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Abstract 

The  major  objectives  of  this  summer  research  project  were  to:  (1)  evaluate  some  of  the 
currently  available  software  engineering  tools  for  developing  and  maintaining  parallel 
software,  (2)  parallelize  a  computationally  intensive  portion  of  serial  code  contained  within 
the  Air  Force's  data  fusion  model  and  (3)  propose  new  and/or  enhanced  tools  and 
methodologies  to  aid  in  the  development  and  maintenance  of  parallel  software.  Objectives 
(1)  and  (2)  were  accomplished  concurrently  by  employing  two  software  engineering  tools 
to  the  process  of  parallelizing  a  small  portion  of  the  serial  data  fusion  model.  The  primary 
focus  here  was  not  so  much  on  the  resulting  parallel  software,  but  rather  on  the  process 
involved  in  developing  the  parallel  code.  The  tools  were  found  to  be  useful  aids  in  the 
sense  that  they  either  simplified  or  automated  certain  low-level  aspects  of  developing 
parallel  software;  thus  allowing  for  rapid  testing  and  prototyping  of  various  system-level 
software  designs.  Future  research  in  the  area  of  parallel  software  engineering  tools  and 
techniques  is  suggested  at  the  end  of  this  report. 
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1.  Introduction 

A.  Background 

Over  the  past  several  decades  the  state  of  software  engineering  for  serial  computers  has 
matured  into  a  quite  sophisticated  science.  In  contrast,  the  unique  software  engineering 
problems  associated  with  developing  and  maintaining  softv/are  for  massively  parallel 
computers  have  only  recently  begun  to  surface  and  be  seriously  addressed.  While  the 
concept  of  parallel  computing  is  as  old  as  the  era  of  electronic  confuting,  only  within  the 
past  decade  have  conunetcially  available  and  cost-effective  parallel  computing  systems 
become  available  [6].  Ih  February  1985  the  Intel  Corporation  announced  the  iPSC,  a 
computer  based  on  a  (then)  new  structure  called  the  hypercube  architecture  [6].  Early 
demonstrations  showed  that  for  certain  applications,  a  128-processor  hypercube  could 
sustain  a  processing  rate  of  10  MFLOPS  (million  floating-point  operations  per  second); 
roughly  one-fifteenth  the  rate  of  the  (then)  fastest  programs  on  the  Cray  X-MP-2X,  at  one- 
twentieth  the  cost  In  1986  Thinking  Machines  Corporation  announced  the  Cbnnection 
Machine,  consisting  of  between  16  thousand  and  64  thousand  processors  arranged  on  a 
very  high  speed  network.  While  individually  the  nodes  were  not  very  powerful, 
collectively  they  could  handle  many  symbolic  computations  at  very  high  speeds  [6,7]. 

Some  parallel  computers  were  built  as  far  back  as  the  1960s,  although  most  of  these 
were  more  of  a  laboratory  curiosity  than  a  practical  and  cost-effective  means  of  doing  high- 
performance  computing.  Most  notable  of  these  early  machines  was  the  ILLIAC  IV 
computer,  constructed  at  the  University  of  Illinois  in  the  late  1960s.  While  limited  memory 
and  expensive  hardware  prevented  proliferation,  it  inspired  much  interesting  work  in 
parallel  algorithms.  There  have  been  many  parallel  machines  built  (both  commercially  and 
in  research  laboratories)  since  and  prior  to  Intel's  announcement  of  the  iPSC.  For  a  more 
comprehensive  survey  of  the  evolution  of  parallel  computing,  the  reader  is  referred  to  the 
following  references:  [6,7,8]. 

B.  Motivation 

The  following  quote  is  taken  from  an  editorial  written  by  Peter  J.  Denning,  past  editor-in- 
chief  of  the  Communications  of  the  ACM,  "...  just  beyond  the  hardware  speed  barrier  lies 
another:  the  software  barrier.  We  are  good  at  understanding  sequential  algorithms,  but  we 
have  little  e  qrerience  with  algorithms  that  direct  the  activities  of  parallel  processors.  We  do 
not  know  how  to  program  the  new  [parallel]  machines!"  [6].  To  date,  most  of  the 
successes  in  obtaining  significant  speed-up  by  using  parallel  processing  have  been  limited 
to  fairly  specific  application  areas  such  as  signal  processing,  image  processing,  and 
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con^)utaddn  of  certain  classes  cf  muldvariable  funcdcns.  There  has  been  some  progress  in 
the  area  of  paralielizing  compilers  (compilers  which  attempt  to  automatically  parallelize 
conventional  serial  code),  however,  it  is  not  clear  if  this  approach  alone  will  ultimately 
prove  to  be  sufficient  Many  researchers  believe  that  a  combination  of  several  different 
approaches  will  eventually  be  required  in  order  to  develop  efficient  parallel  software 
development  techniques  flexible  enough  for  "general  purpose"  computing.  The  goals  of 
this  project  were  to  try  and  tie  together  some  currently  available  parallel  software 
engineering  tools,  and  suggest  directions  of  future  research. 

C.  Organization  of  the  Report 

The  report  is  organized  in  the  following  manner.  In  Section  H,  a  brief  description  of  the 
software  engineering  tools  utilized  and  some  background  material  on  the  data  fusion 
problem  are  given.  In  Section  HI  we  report  on  the  process  of  applying  the  software 
engineering  tools  to  the  task  of  parallelizing  a  small  portion  of  the  serial  data  fusion  code. 
Section  IV  proposes  some  directions  of  fiituie  work.  The  Appendix  contains  some  selected 
program  listings. 

n.  Description  of  the  Tools  and  the  Data  Fusion  Problem 
A.  Parallel  Proto  (PProto) 

According  to  its  developers,  "PProto  is  a  rapid-prototyping  environment  that  supports  the 
software  engineer  in  the  definition  of  software  system  specifications  and  the  prototyping 
and  evaluation  of  specifications  and  designs"  [5],  In  the  paragraphs  below  we  briefly 
describe  four  of  the  basic  components  of  PProto:  (1)  software  simulation,  (2)  hardware 
simulation,  (3)  mapping  software  to  hardware  and  (4)  performance  analysis.  For  a  more 
detailed  explanation,  the  reader  is  referred  to  the  PProto  User's  Manual  [5]. 

PProto  uses  both  graphical  and  textual  constructs  in  the  description  and  simulation  of 
the  system  software.  At  a  high-level,  the  software  is  described  graphically  using  nodes  and 
links.  The  nodes  represent  processes  and  the  interconnecting  links  represent  the  flow  of 
control  and/or  data  messages  between  the  processes.  Contained  within  each  node  is  a 
description  of  its  "behavior,"  which  is  written  in  a  prototyping  language  called  SSDL 
(System  Specification  and  Design  Language).  Process  nodes  and  links  are  created 
graphically  by  simply  selecting  the  appropriate  item  from  a  graph  editor  menu.  To  edit  the 
behavior  of  a  process  node  (i.e.,  the  SSDL  program),  the  node  is  first  selected  followed  by 
the  selection  of  the  behavior  editor,  which  brings  up  a  window  for  typing  in  the  SSDL 
behavior  for  that  node. 
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PProto  also  allows  the  user  to  create  a  model  of  the  target  hardware.  For  this,  a 
hardware  graph  editor  is  provided  which  operates  in  a  similar  fashion  to  the  graphical  editor 
for  creadng  the  software  model  In  the  hardware  graph,  the  nodes  represent  processors 
and  the  links  represent  actual  hardware  busses.  The  relative  speeds  of  the  processors  and 
busses  can  be  assigned  by  selecting  the  appropriate  item  horn  a  menu.  The  hardware  editor 
also  models  memory  modules,  for  which  relative  memory  access  times  may  be  assigned. 
The  hardware  graph  editor  allows  the  user  to  quickly  construct  and  edit  the  target  hardware. 

One  of  the  very  useful  aspects  of  PProto  is  that  it  allows  the  user  to  view  the  system 
software  independent  from  the  hardware.  So,  once  the  software  and  hardware  models  are 
in  place,  the  mapping  editor  is  then  used  to  map  the  processes  (software)  onto  physical 
processors  (hardware).  By  separadng  the  hardware  ftom  the  software,  the  user  can  (at  least 
initially)  focus  on  the  comecrness  of  the  software.  Later,  by  simulating  various  mappings 
onto  a  fixed  hardware  model,  for  example,  the  user  can  fine-tune  the  initial  software  design 
(if  necessary)  so  as  to  achieve  better  overall  performance  and  utilizations. 

PProto  provides  various  forms  of  output  from  the  simulation  which  aid  the  software 
engineer  in  uncovering  any  bottlenecks  (in  both  hardware  and  software)  that  may  result 
from  a  particular  mapping.  The  simulation  itself  is  animated  in  the  sense  that  the  software 
process  nodes  change  color  (from  blue  to  red)  whenever  a  behavior  within  a  process  is 
executing.  The  interconnecting  data/control  links  also  change  color  when  a  message  is 
transmitted  across  them.  The  hardware  graph  is  animated  as  well  with  giaphiral 
performance  meters  placed  beside  the  processors  and  busses  showing  the  percentage  of 
time  each  piece  of  hardware  is  used.  The  overall  execution  time  (measured  in  simulation 
time  units)  is  displayed  as  the  simulation  is  running.  At  the  end  of  the  simulation  a 
summary  text  file  is  printed  to  standard  output. 

B.  Computing  Surface  Tools  (CSTools) 

CSTools  is  a  collection  of  support  programs  supplied  by  Meiko  Scientific  Inc.  for  use  with 
their  transputer  based  parallel  processing  systems.  The  transputer  system  used  was  the 
MK202,  which  consists  of  a  bank  of  sixteen  T8(X)  transputers  (in  the  computational 
engine)  hosted  by  a  Sun  4/470  workstation.  The  unique  property  of  the  transputer  system 
is  that  the  interconnection  topology  (between  the  transputer  nodes)  can  be  contro^’ed  via 
software  commands,  provided  that  the  in-  and  out-degree  at  each  node  does  not  exceed 
four.  The  term  computing  surface  refers  to  the  set  of  transputer  nodes  and  interconnecting 
hardware  busses  used  by  a  particular  application.  One  of  the  tools  (within  CSTools) 
enables  the  user  to  either  explicitly  define  the  interconnection  topology  or  allow  a  generic 
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topology  to  be  chosen  by  the  CSTools  operating  system.  Examples  of  common 
interconnection  topologies  are  the  binary-tree,  the  four-neighbor  mesh  and  the  linear  array. 
There  is  also  a  tool  which  enables  the  user  to  download  parallel  programs  from  the  host 
machine  onto  the  computing  surface  nodes.  We  point  out  here  that  neither  of  these  tools  are 
graphically  based  and  are,  therefore,  a  bit  cumbersome  to  use.  Better  user  interfaces  for 
these  tools  would  be  graphical  ones  like  those  implemented  in  PProto  for  describing 
software,  hardware,  and  the  mapping  between  them.  This  issue  will  be  addressed  in  a 
latter  section  on  future  directions  of  research. 

There  are  also  tools  within  CSTools  to  aid  the  programmer  in  setting-up 
communication  protocols  for  exchanging  messages  between  processors.  The  programming 
environment  associated  with  these  communication  protocols  uses  the  concept  of  a 
confuting  surface  network  (CSN),  which  allows  the  programmer  to  abstract  away  from 
the  physical  interconnection  topology  associated  with  a  particular  configuration  of  the 
computing  surface.  From  the  programmer's  point  of  view,  messages  are  sent  from  an 
origin  node  to  a  destination  node  via  a  transport  through  the  CSN.  The  CSN  is  responsible 
for  the  actual  store-and-forward  routing  associated  with  passing  the  messages  through  the 
physical  interconnection  topology. 

CSTools  supports  four  types  of  protocols  for  inte:processor  communication:  (1) 
blocking-synchronous,  (2)  blocking-asynchronous,  (3)  nonblocking-synchronous  and  (4) 
nonblocking-asynchronous.  In  the  blocking  protocols,  control  is  not  returned  to  the  caller 
process  (i.e.,  the  sender  of  the  message)  until  "completion"  of  the  transmission  (thus, 
execution  at  the  caller  process  is  blocked  until  an  acknowledgement  is  received).  For  the 
synchronous  cases,  "completion"  is  defined  whenever  the  message  is  successfully  received 
at  the  destination  node,  while  in  the  asynchronoi's  cases  "completion"  is  defined  when  the 
message  successfully  enters  the  CSN.  In  the  nonblocking  cases,  control  is  given  back  to 
calling  process  immediately. 

C.  The  Data  Fusion  Problem 

Data  fusion  encompasses  a  very  broad  range  of  problems  and  is  therefore  difficult  to  define 
precisely.  However,  within  the  domain  of  Air  Force  applications,  data  fusion  usually  refers 
to  those  settings  in  which  vast  amounts  of  sensor  data  (collected  from  remote  locations)  is 
sent  to  a  central  location  for  processing  and  interpretation.  The  central  processing  location 
is  referred  to  as  the  fusion  center.  The  underlying  objective  of  the  fusion  center  is  to 
"make  sense"  of  the  incoming  data  in  a  real-time  or  near  real-time  fashion.  Depending  on 
the  specific  application,  the  definition  of  "making  sense"  of  the  data  varies.  The  data  fusion 
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moddstaSedm^yic^fia^BEnaniyiaixibitsMia^aesdcemis^gmxxsss^diJi&oea 
zemoic  elecaoBoc  sesson.  The  daia  tepotted  by  Ac  tanons  sassoss  psovldc  the  fossoo 
center  widi  a  pece  of  icidfigoice  icfonoa&n  aboci  vanoezs  smEiasy  isgess  (sodi  as  tssgef 
postkm  and/or  vekidiy  cstimaTes).  Fbrddspngect  wefoccsed  on  ^Advanced  Sensor 
Bq)kHtalioo  (ASE)  mode^  wiudi  is  acanem  data  fosioo  model  wiihin  dse  Air  Forces  A 
fmictk»al  overview  of  the  ASEciodd  is  shown  in  Hg.  1. 

Data  func»  laobknis  are  inherently  coa^atsaooaiJly  intensive  for  several  reasons. 
First,  doe  to  the  massive  amoont  incQonng  data  to  be  processed  in  a  Q-pkal  scenetio, 
even  die  an^lesr  data  ioson  algoiidims  may  safifer  finom  slow  execution  on  coovendcnal 
serial  con^mters.  Second,  die  accmacycrf^  modi  of  die  inconnng  data  m^  be  very  low, 
dius  die  fimwialgoridmis  most  be  soi&isdcaiedeooD^  to  make  intdligentdedsons  based 
on  rjeasurements.  Hnally,  not  <niiy  is  there  a  variance  in  die  accnracy  of  die  data,  but 

pordons  if  incoming  data  records  m^  be  mis^g  or  incon^lete. 


Fig.  1:  A  functional  overview  of  the  ASE  data  fusion  model. 


III.  The  Parallelization  Process 

In  this  section  we  describe  the  process  taken  to  parallelize  a  small  portion  of  the  data  fusion 
program.  It  should  be  understood  that  the  approach  taken  is  by  no  means  the  only 
possibility,  and  we  do  not  claim  that  the  overall  approach  taken  is  the  "best"  possible 
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soSmaoo  (as  weasaad  by  aay  pamcsbEr  sofiwaie  ea^ziming  lomk).  Osr  sodo^-mg 
oSgecihTwasiociap3ayaB^)pn]^ia«bxiidieavaiIa!^ax&coGiIdbee«a!ii£e^  Asof 
die  wiiiiiig  of  dns  lepaa,  sosxse  of  me  laitgr  st^es  of  tise  pnxress  woe  soil  oo-gmng, 
howem;cnoii^  progress  was  made  (oreaqieikoce  gained)  to  repoeioaaB  stages. 

A.  HigftLevel  UnderstanSng  (ftkeFmcaonalay 

Tbe  csnent  data  fbsioo  modd  cootains  neaity  SOOLOOO  lines  of  serial  FCKITRAN  code.  To 
make  our  project  roeamagfiil,  we  needed  tt)  extract  a  ponion  of  die  code  whidi  was  bodi 
paialldizatte  and  manageaMe-Becanse  we  <fid  not  have  an  automated  tool  for  derecdng 
paiandism  conidned  widiin  serial  FORTRAN  code,  we  resorted  to  the  task  of  karoing 
some  of  die  fai^level  foncdonali^  of  the  data  fbsico  modd.  After  qipioximatdy  two 
wedES  of  reading  docmnentaiuHi  and  talking  with  pet^le  diiecdy  involved  in  the  original 
devekqmient  of  the  model,  the  coiidation  node  was  chosen  as  a  prime  candidate  for 
paralle^  zaiion.  The  conelaiicKi  node  was  compntaiicHiaUy  intensive  and  believed  to  be 
highly  paiaUelizable.  Within  the  conelation  node  there  is  a  module  called  enti^' 
assodation.  The  purpose  of  entiQr  associarirai  is  to  detennine  whether  a  new  (manning) 
target  repmt  “associates**  with  known  targets  stored  in  the  data  base.  If  it  does,  then  a 
confidence  measbre  is  updated  within  die  rqxxt  reond  stored  in  the  data  base.  On  die  odier 
hand,  if  the  new  target  rqicn  does  not  associate  widi  any  of  the  reports  in  the  data  base, 
dum  it  is  stored  as  a  new  "candidate"  target  record  in  the  data  base. 

The  underlying  measure  of  assodarion  (MOA) — which  is  used  to  determine  the 
"closeness”  between  two  target  rqioits— is  derived  fimn  a  stadsncally  based  measure  called 
the  Mahalanolnan  metric.  The  MOA  value  is  a  real  number  between  zero  and  one.  An 
MOA  value  of  zero  implies  that  two  reports  are  not  assodated,  while  a  value  of  one 
represents  the  highest  possible  degree  of  association.  The  computation  of  the  MOA 
between  two  reports  requires  a  posidon  vector  and  covariance  matrix  fiom  each  iqxirt  The 
mathemadcal  formula  for  computing  the  MOA  is  given  by 

MOA  =  exp  [1/2  (r,  -  R  (x,  -  )], 

where  andx^  doiote  posidon  vectors  of  a  target  report  (from  the  sensors)  and  a  candidate 
report  (from  the  data  base),  respectively.  The  variable  R  is  a  matrix  defined  by 

R  =  R|  +  Rq  , 

where  R^  and  R^.  are  the  covariance  matrices  for  the  target  and  candidate  reports, 
respectively.  The  covariance  matrices  capture  the  uncertainty  inherent  in  the  position  data  as 
measured  by  various  types  of  sensors.  More  detml  on  the  theory  behind  the  Mahalanobian 
metric  can  be  found  in  reference  [1]  and  the  references  therein. 
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B.  Serial  Sc^4tre  Mock-Up 

Wiimn  liie  assodaiioc  modak  (rf*  Ac  data  fii^n  model,  tbae  is  a  sectioa  cods 

vriach  dettsmises  tbs  tnaxTimim  MOA  vales  between  an  incosning  target  nqM3it  and  a 
odlectioa  of  caodBdaie  iqxKts  stoced  in  die  fiisoo  cenier?  data  base.  Oer  starting  {x^  for 
dns  pccyect  was  to  assume  that  die  coDecdon  candidate  rqxxts  bad  been  sekesed.  Thus, 
die  poedonrfeode  of  interest  was  that  part  wfaidi  finds  the  maxinigtii  value  of  MOA.  The 
structure  <rf  die  serial  code  for  finding  die  nBMmmn  MOA  is  shown  in  Rg.  2. 

begin  maximum  MOA  calculation 
do  for  each  target 

do  for  each  candidate 
confute  !-50A 

if  5^A  >  current  max  t^A  then 
current_max_MOA  :=  MOA 
end  if 
end  do 

update  data  base 
end  do 

end  maximum  MOA  calculation 
fig.  2:  The  structure  of  the  serial  code  for  finding  maximum  MOA  values. 

We  wrote  three  FORTRAN  programs  for  emulating  the  above  Maximum  MOA  part 
of  the  serial  model.  Two  of  the  pro^ams  simply  create  synthedc  target  and  candidate  data 
files.  The  third  program  actually  computes  maximum  MOA  values  between  target  and 
candidate  data  records.  An  overview  of  this  serial  system  is  shown  in  fig.  3.  Due  to  space 
limitations,  we  shall  not  describe  the  details  of  the  actual  serial  FORTRAN  programs  nor 
the  data  stnictures  used  to  store  the  target  lecwds,  however,  some  selected  program  listings 
and  output  files  can  be  found  in  the  appendix.  The  serial  machine  used  was  a  VAX/780,  as 
this  machine  was  also  the  target  machine  used  to  execute  the  actual  ASE  riKxlel. 
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Rg.  3:  Ovcivicw  of  the  saial  software  mock-up. 


C.  The  Basic  Parailetizadcn  Strategy 

Because  our  goal  was  to  ultimately  implement  the  parallel  software  on  the  transputer 
system,  the  following  strategy  was  develq)ed  under  the  assumption  that  the  target  machine 
wcMtld  have  a  distnbuted  memory  architecture. 

The  simple  looping  structure  associated  with  the  serial  code  (for  computing  maximum 
MOA  values)  made  the  task  of  deciding  on  a  basic  parallelization  strategy  fairly 
straightforward.  We  decided  on  the  following  approach.  (1)  Partition  the  target  records 
evenly  across  the  nodes  (i.e.,  store  equal-sized  chunks  of  target  data  in  each  local  memory 
module).  (2)  Pipeline  candidate  records  through  the  network.  (3)  Whenever  a  node 
receives  a  candidate  record,  it  updates  its  current  maximum  MOA  values  (one  for  each 
target  record)  and  forwards  the  candidate  record  to  the  appropriate  neighbor(s)  in  the 
broadcast  tree. 
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r.  Parallel  Si^tvareSystttn  Design  (PProio) 

We  used  PRcoto  to  evaloaie  the  roexit  of  our  fosic  parallelizaiion  strategy  at  a  system 
scrftwaie  leveL  T»o  disdnct  i7>p!2iiieataiions  of  die  ba^  strategy  were  omsideied.  In  the 
first  inqilemeniaiion  the  p.’ocessors  (and  processes)  were  connected  as  a  innaiy  tteeand  in 
tbesecf^tht^wKeoxiiiectedasaline?'  array.  Hgs  4  and  5  show  the  graphical  views  of 
these  two  inqilementadoos  in  both  hardware  and  software.  Note  that  the  lone^  nodes  in 
eadi  figure  rqgesent  die  host  machine  and  the  other  nodes  denote  the  tran^uteis.  In  the 
software  graphs  note  that  diere  are  many  mrae  links  than  in  the  hardware  graphs,  because 
die  host  node  must  have  a /iogica/  OMuiectioa^mn  itself  to  each  other  node.  These  lo^cal 
connections  are  used  to  seed  the  chunks  of  target  data  ftom  the  host  down  to  each 
tran^uter.  Also,  there  are  lo^cal  links  ftom  each  transputer  node  to  the  host  which  are 
used  by  the  transputers  to  send  their  final  results  to  the  host  In  the  hardware  graphs  the 
links  represent  physical  busses  between  the  actual  processes. 
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Fig.  4:  Graphical  view  of  the  binary  tree  implementanon  on  PProto. 


Fig.  5:  Graphical  view  of  the  linear  array  implementation  on  PProto, 


The  behavior  of  the  root  node  in  the  binary  tiee  and  the  first  node  in  the  linear  array 
were  almost  identical,  the  exception  being  that  in  the  binary  tree  the  candidate  record  was 
passed  to  the  two  children,  whereas  in  the  linear  array  the  candidate  record  was  passed 
down  to  the  next  neighbor.  The  behaviors  of  the  intermediate  nodes  and  last  node(s)  in 
both  implementations  also  had  obvious  similarities.  The  behavior  of  internal  nodes  from 
both  implementations  are  also  shown  in  Figs.  4  and  5  (the  textual  portions). 

We  ran  some  tests  on  the  two  implementations  and  observed  that  the  binary  tree 
implementation  ran  about  10%  faster  than  the  linear  array  (in  terms  of  PProto's  simulation 
time  units).  In  these  experiments  we  used  data  files  containing  70  target  records  and  100 
candidate  records.  It  was  noted  that  had  many  more  processors  been  employed,  then  the 
advantage  of  using  the  binary  tree  structure  over  the  linear  array  would  become  more 
apparent,  because  the  time  for  "filling  the  pipe"  on  the  binary  tree  is  proportional  to  the 
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logaiitiim  of  the  nsmber  of  processors  while  the  filling  time  for  the  linear  array  grows 
linearly  with  the  number  of  processors.  With  only  seven  parallel  processors,  this 
a^nqxotic  effect  was  not  a  dominant  factor. 

E.  Parallel  Software  ImpIementaaon(CSTools) 

Uns  portitm  of  the  parallelization  procedure  was  not  completed  by  the  end  of  the  project 
p^od.  However,  we  did  gain  experience  with  programming  the  transputers  for  a  different 
application,  and  therefore  we  can  briefly  explain  the  expected  transition  from  the  ^stem 
software  design  to  actual  imploneotation  on  the  transputers.  First,  the  hardware  can  be 
configured  as  dther  the  binary  tree  or  linear  array  using  the  appropriate  tool  from  CSTocIs. 
Second,  the  computational  part  of  the  behaviors  from  PProto  can  be  converted  to  either  C 
or  FORTRAN  easily  (requires  simple  looping,  math  operations,  and  comparisons). 
Hnally,  die  communication  portions  of  the  behaviors  should  also  be  fairly  straightforward 
to  implement  with  the  CSTools  protocols.  In  particular,  the  transmitting  of  data  out  through 
a  port  in  the  SSDL  behaviors  is  accomplished  by  a  blocking  transmission  through  a 
transport,  as  defined  under  the  CSTools  programming  environmenL 

IV  Directions  of  Future  Work 

A.  A  Suggested  Enhancement  to  PProto. 

From  our  experience  with  Pftoto,  it  appears  that  the  delay  associated  with  sending  a 
message  across  a  communication  bus  is  independent  of  the  size  of  message  being  sent  In 
our  implementations  of  the  maximum  MOA  calculation,  we  assigned  a  relative  speed  of 
unity  to  the  busses  and  discovered  that  only  one  time  unit  was  charged  for  sending  either  a 
chunk  of  10  target  records  or  a  single  candidate  record.  While  PProto  is  a  system  software 
tool,  we  believe  that  a  finer  level  of  uetail  should  be  incorporated  in  estimating  the 
communication  delays. 

B.  A  Suggested  Enhancement  to  CSTools 

The  current  tools  within  CSTools  for  defining  the  interconnection  topology  and  mapping 
the  processes  onto  processors  require  the  user  to  type-in  the  correct  commands.  These 
commands  are  quite  detailed  and  can  become  confusing  when  the  number  of  processors 
used  exceed  3  or  4.  We  recommend  that  a  graphical  method  be  implemented  based  on  the 
same  techniques  developed  by  the  designer  of  PProto.  So,  the  user  would  have  a  graphical 
view  of  the  hardware  and  software  and  a  mapping  editor  for  assigning  processes  to 
processors.  The  implementations  of  this  suggestion  would  involve  merging  a  portion  of 
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the  gr^faical  technology  developed  for  PProto  v/ith  the  cunent  CSTools. 

C.  A  Tool  for  Balancing  Commumcadon  and  Computauon  in  Parallel  Systems 
Thoe  seeots  to  be  a  gap  in  the  availabili^  of  tools  or  techniques  for  aiding  the  software 
en^eerin  striking  a  balance  between  the  computational  and  communicational  aspects  of 
developing  parallel  software.  Through  our  own  experiences  in  pa^el  programming,  we 
have  found  ihat  if  the  ratio  of  communication  to  conqiutation  is  too  high,  then  substantial 
bottlenecks  vasy  arise  which  can  degrade  overall  perftsmance  significandy. 

At  die  higher  levels  of  developing  parallel  software,  die  software  engineer  may  not  be 
too  concerned  with  the  communicadon  requirements  and  overtiead  of  the  software  system. 
Instead,  issues  of  conecmess  and  parddoning  of  code  and  data  are  usually  considered. 
Eventually,  however,  when  the  parallel  program  is  implemented  on  a  pardcular  piece  of 
hardware,  the  software  engineer  may  find  severe  botdenecks  caused  by  contendon  for  the 
commuidcadon  resources.  For  example,  if  a  pardcular  software  process  transmits  an 
average  of  100  bytes  of  data  per  every  1  msec  of  computadon  (an  average  of  1(X) 
Kbytes/sec)  across  a  communicadon  chaimel  having  a  capaciQr  of  only  10  Kbytes/sec,  then 
obviously  queues  will  eventually  overflow  and  potendally  cause  botdenecks  and/or 
deadlocks. 

A  quesdon  now  arises.  Should  the  software  engineer  by  responsible  for  coundng 
clock  cycles  of  computadon  between  conununicadons  to  ensure  that  the  software  executes 
efficiendy  on  the  target  hardware?  We  believe  the  answer  is  no  for  several  reasons.  First, 
such  an  approach  would  force  the  software  engineer  to  "commit"  to  a  pardcular  parallel 
machine,  and  therefore  design  the  software  around  the  hardware  specifications.  Second, 
such  an  approach  is  simply  too  tedious.  Hnally,  such  an  approach  does  not  lend  itself  to 
adapting  the  software  to  execute  on  different  machines. 

We  propose  to  develop  a  tool  to  aid  the  software  engineer  in  controlling  the  flow  of 
transmissions  so  as  not  to  overload  the  communication  network.  The  benefits  of 
employing  a  flow  control  mechanism  are  well  understood  in  the  areas  of 
telecommunication/data  networks  and  automobile  traffic  control.  For  instance,  in 
telecommunication  networks,  certain  callers  may  be  denied  network  access  whenever  the 
utilization  of  communication  resources  exceeds  a  certain  threshold.  If  properly  controlled, 
this  throtding  of  network  access  will  result  in  better  overall  throughput.  Similarly,  the 
proper  use  of  traffic  lights  can  have  the  same  effect  by  controlling  the  flow  of  traffic  on  city 
streets.  Our  approach  is  to  employ  similar  techniques  in  the  control  of  traffic  between 
parallel  processors. 
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Appendix 

Selected  Program  Listings  and  Outputs 
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Frograun  File  Nzune:  MOA.FOR 
Programnier :  John  Antonio 
Date:  June  19^  1991 

Description:  This  program  coo^utes  the  maximimi  measure  of 

association  (MOA)  between  a  set  of  3cnown  candidates 
and  a  set  of  target  reports*  For  each  target  report, 
the  candidate  having  the  maximum  MOA  is  determined  and 
written  to  an  output  file.  Each  camdidate  amd  target 
record  consists  of  five  items:  an  identification  number, 
an  x-coordinate,  a  y-coordinate,  a  standard  deviation  of 
error  in  the  x  direction,  and  a  standard  deviation  of  error 
in  the  y-direction.  Each  record  of  the  output  file 
consists  of  three  items:  the  identification  number  of 
the  target,  the  maximum  MOA  value,  the  identification 
niamber  of  the  candidate  associated  with  the  maximxim 
MOA  value.  Ccindidate  data  files  are  created  by  the 
program  BUILD^CAND.FOR,  and  target  data  files  are 
created  by  the  program  BUILD^TARG.FOR. 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

★ 

* 

★ 


iticicit  ifk**ieii***it  *it  icitif  it  ******************  it  it  it 


implicit  none 

integer*4  no_cand,  no_targ,  i,  j,  dximmy 

integer*4  maxmoa^index (10000) 

character*16  cand  file,  targ_file,  out^file 

real*4  xt_coord(10070) ,  yt_coord (10000) 
real*4  xt__std (10000) ,  yt_atd(10000) 
real*4  xc_coord(10000) ,  yc__coord(10000) 
real*4  xc__std (10000) ,  yc_std(10000) 
real*4  moa (10000),  maxmoa (10000) 
real*4  tl,  t2,  deltal,  delta2 


^k**** ***************************************************************  ********* 

Input  data  file  names  from  the  user.  * 

k^**************** **************************************************  ********* 


* 

write (5, 1) 

1  format ('  Input  name  of  candidate  input  file') 
read(5,6)  cand_file 

write (5,2) 

2  format ('  Input  name  of  target  input  file') 
read(5,6)  targ_file 

write (5, 3) 

3  format ('  Input  name  of  association  output  file  (to  be  created)') 
read(5,6)  out_file 

£  foim]at(a) 

****************************************************************************** 

*  Start  the  "read/compute/write"  timer  * 

*******************  ********************************************************** 


* 


t2  =  secnds(O.O) 

* 

***************************************************************************** 

*  Open  the  input  data  files  * 

*  Note:  The  file  names  are  chosen  by  the  user  * 

*i<*1i*****ii***ic***1i*ic*********ii***-k********ii*****miic*******-k*i,i,i,i,-ki,i,ici,-k*iiic1ti,** 

* 

open(unit=9,  file=cand_file,  status=' old' ) 
open (unit=10, f ile=targ_f iJ e,  status=' old' ) 

* 

************************************************************************** *** 
*  Read  in  the  number  of  candidates  and  targets  * 
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ilc*************ir************ik*********1t*1c****it****-k-k*ic*****iiit**it::T.x-)c*itit******* 

read (9,  *) 
read (9,  *) 

read ( 9 ,  9  9 )  no_cand 
read  (9,*) 
read (9,*) 
readdO,  *) 
read{10, *) 
read  (10, 99)no_targ 
read (10,  *) 
read (10,  *) 

33  format (lOx,  i5) 

*  Read  in  the  x,y  coordinates  and  stamdard  deviations  from  the  * 

*  candidate  and  target  data  files.  * 

4k '*«******-A  ***************  *****«r  ************************************  ********** 
■* 

do  i=l ,  no_Ccind 

read(9,*)  diimmy,xc_coord(i)  ,yc_coord(i)  ,KC__std(i)  ,yc_std(i) 
end  do 

do  j=l,  no_targ 

read  ( 1 0 ,  * )  dummy ,  xt_coord  ( j ) ,  y t_coord  ( j ) ,  xt_st d  ( j ) ,  yt_std  ( j ) 
end  do 
close (9) 
close (10) 

■* 

***************************************************************************** 
'*  Start  the  computation  timer  * 

**************  ;i******** ****************************** ************************ 
-* 

tl=secnds(0.0) 

* 

4Nk*************************************************************************** 

■*  Start  the  MOA  calculation  * 

A  A  A:  A  1ft  A  *  A  A  A  *  *  A  A A  A  A  A  A  A  *  *  A  A  ifc  4:  A  A  A  A  A  A  4r  A  *  *  *  A  A  A  A  A  A  *  A  4lrik: ‘A  *  *  4^  1^  A  A A  A  A  A  A  *  A  4:  A  A A  *  *  *  * 
A 

do  j=l,  no_targ 
do  i=l,  no_cand 

moa (i) =exp (-0 . 5* ( ( (xt_coord ( j) - 

5  xc_coord(i) ) **2) * 

6  (1 . / (xt_std ( j) +xc_std (i) ) ) + 

&  ( (yt_coord( j) -yc_coord(i) ) **2)  * 

&  (l./(yt_std(j)+yc_std(i))))) 

end  do 

maxmoa ( j) =0 . 0 
maxmoa_index ( j )  =0 
do  i=l,no_cand 

if  (moa (i) .gt. maxmoa (j) )  then 
maxmoa ( j )  =moa ( i ) 
maxmoa_index ( j ) =i 
end  if 
end  do 
end  do 

* 

AAA  AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

*  End  of  the  MOA  calculation  * 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

A 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

*  stop  computation  timer  * 
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■*)**************************************************************************** 
:*  Open  the  output  file  and  write  out  the  results.  * 

tk******************************************* ********************************* 

■ 

open (unit=ll,  file=out_file,  status=' new' ) 

write (11, 77)  out_file 
T7  format ('  File  name;  a) 

write (11, 78) 

Tfl  format ('  This  data  file  was  created  by  moa.for'> 

write (11, 79)  cand_file,  targ_file 

79  format ('  Candidate  input  file:  ',  a,  'Targ.  input  file:  ',a) 

write (11, 799)  no_cand,  no_targ 

■72^9  format('  No.  of  candidates;  ',i5,llx,'  No.  of  targets:  ',i5) 
write (11, 80)  deltal 

SB  format ('  Time  to  compute  all  of  the  maximum  MOAs:  ',  flO.5) 

write (11, *) 
write (11,81) 

format ('  Targ.  No.  Max.  MOA  Max  MOA  Cand.  No.') 
do  j=l,  no_targ 

write (11, 111)  j,  maxmoa(j),  maxmoa_index ( j ) 
end  do 

ZUl  format  (is,  9x,  f7.5,  8x,  iS) 

***************************************************************************** 
=*  atop  "read/compute/write"  timer 

rk**************************************************************************** 

delta2=secnds (t2) 

write (11, *) 

write (11, 82)  delta2 

B2  format ('  Time  for  reading,  computing,  and  writing:  ',fl0.5) 

write (5,*)  deltal,  delta2 

rk 

close (11) 

** 

stop 

end 
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**********'^****»:*********************************************** 


File  name:  moa22_780.dat 

This  data  file  was  created  by  moa.for 

Candidate  input  file:  c2.dat  Targ.  input  file:  t2.dat 

No.  of  candidates:  100  No.  of  targets:  100 

Time  to  compute  all  of  the  maximum  MOAs:  1.65234 


Targ .  No . 

Max .  MOA  Max 

MOA  Cand.  No. 

1 

0.69514 

64 

2 

0.96774 

91 

3 

0.86108 

35 

98 

0.95123 

59 

99 

0.10026 

14 

100 

0.73544 

89 

Time  for  reading,  computing. 

and  writing: 

(r****************************  **************: 

File  name: 

moa23  780.dat 

This  data 

file  was  created  by  moa.for 

Candidate 

input  file:  c2.dat 

Targ 

No.  of 

candidates:  100 

No 

Time  to  compute  all  of  the  maximum  MOAs: 

Targ .  No . 

Max .  MOA  Max 

MOA  Cand .  No , 

1 

0.41403 

81 

2 

0.00127 

94 

3 

0.61967 

70 

998 

0.91219 

100 

999 

0.62772 

18 

1000 

0.84568 

15 

4.81250 


1000 


9.12891 


Time  for  reading,  computing,  and  writing;  18.22656 
*************************************************************** 


File  name:  moa24_780.dat 

This  data  file  was  created  by  moa.for 

Candidate  input  file;  c2.dat  Targ.  input  file:  t4.dat 

No.  of  candidates:  100  No.  of  targets:  10000 

Time  to  compute  all  of  the  maximum  MOAs:  83.86719 


Targ.  No.  Max.  MOA  Max  MOA  Cand. 


1  0.78472  81 

2  0.66918  97 

3  0.84246  55 

9998  0.37935  48 

9999  0.78605  28 

10000  0.96162  17 


No. 


Time  for  reading,  computing,  and  writing;  158.89063 
*************************************************************** 
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****)»'*******«'1.**«**««*««*«***********ilEikik***Kt*«««««i****ik*****««« 


File  naae:  waa32_y6Q .dst. 

This  data  file  was  created  Tsv  acoa.for 

Candidate  insst  File;  c3.dat  Tazg.  issct  file:  t2.dat 

Ho.  of  candidates:  lOQO  Ho.  of  targets:  IQO 

Time  to  cognate  all  of  the  scazinsm  590As:  S. 52734 


Targ.  Ho. 

Max.  MOA 

Vzx.  SfiQA  C 

1 

0.95123 

715 

2 

0.98347 

907 

3 

0.89434 

593 

98 

0.95919 

787 

99 

0.71405 

360 

100 

0.86688 

957 

Ho. 


Time  for  reading,  cogpnticg,  and  writing:  13.91015 


***£>******«****£*****««**«*«***************«*****«************ 


File  naae:  EK)a33_780.dat 

This  data  file  was  created  hy  noa.for 

Candidate  iigmt  file:  c3.dat  Targ.  inpid:  file:  t3.dat 

Ho.  of  candidates :  1000  Ho.  of  targets:  1000 

Time  to  confute  all  of  the  garimra  HQ&s:  86.06984 


Targ,  No. 

Max.  MOA 

1 

0.93359 

2 

0.78600 

3 

0.95123 

998 

0.95230 

999 

0,91952 

1000 

0.92609 

Max  MOA  Cand.  Ho. 
489 
637 
999 

375 

978 

296 


Time  for  reading,  cong)uting,  and  writing:  97.34766 

A************************************************************** 


File  name:  moa34_780.dat 

This  data  file  was  created  by  moa.for 

Candidate  input  file:  c3.dat  Targ.  input  file:  t4.dat 

No.  of  ceuididates:  1000  No.  of  targets:  lOCOO 

Time  to  compute  all  of  the  meuciroum  MOAs:  853.51953 


Targ,  No. 

Max,  MOA 

Max  MOA  Cl 

1 

0.96882 

385 

2- 

0.89593 

678 

3 

0.92169 

6 

9998 

0.96813 

870 

9999 

0.90484 

319 

10000 

0.91034 

19 

Time  for  reading,  computing,  and  writing:  924.83203 

•k-kick'k'kick-kifkiiifkitifk’kifk-k-k'k’kic'k-k’k-kick-kifick'kieic'k'kititit'k-k'klt'k^icickic'k'k'kiclfk-k-k'kic 
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***)k***«1K*********«**#iK***««««i»**«««««!li«**)K**)k**«««*«***«*)k**«* 


File  name:  aio242_J780.<£2t: 

Ihis  data  file  was  created  ^ss  maa.£oT 

Caiadidate  ing^t  file:  c4.d3t  "S^g.  ixaut  file:  t2.dat 


SO.  of  candidates:  10000  Sc.  of  targets: 

gitne  to  ccarante  all  of  tte  maxignac  SSQ&s:  02.02734 

.  So. 


100 


Taro.  So. 

Kax.  HOA 

!£ax  am.  c 

i 

0.96653 

7865 

2 

0.98400 

7562 

3 

1.00000 

6209 

98 

0.97102 

4928 

99 

1.00000 

5451 

100 

1.00000 

4650 

Visas  for  reading,  ccaguting,  and  writing:  142.35938 

*»***************-t*******:*************************-it±********^** 


File  naae:  soa43_7S0.dat 

This  data  file  was  created  by  ooa.for 

Candidate  input  file:  c4.dat  ?arg.  input  file:  c3.dat 

No.  of  candidates:  10000  No.  of  targets:  1000 

Time  to  cozpute  all  of  the  taaxignna  MOAs:  974.14063 


•< 

Targ.  No. 

Max.  MOA 

Kax  MOA  a 

1 

0.97403 

4731 

- 

2 

1.00000 

2160 

' 

3 

1.00000 

7554 

998 

0-96722 

8505 

999 

0.96227 

6606 

1000 

1.00000 

4341 

Time  for  reading,  cosputing,  emd  writing:  1021.78906 
**************************.,************************************ 


i 


File  name:  moa44_780.dat 

This  data  file  was  created  by  moa.for 

Candidate  input  file:  c4.dat  Targ.  input  file:  t4.dat 

No.  of  cemdidates:  10000  No.  of  targets:  IQOOO 

Time  to  coirpute  all  of  the  maximum  HOAs:  8873.75000 


Targ.  No. 

Max.  HOA 

Max  MOA  Cl 

1 

0.98621 

7913 

2 

1.00000 

3057 

3 

1.00000 

1586 

9998 

1.00000 

4948 

9999 

0.97531 

5186 

10000 

1.00000 

1838 

Time  for  reading,  computing,  and  writing:  8979.98047 
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ABSTRACT 

Most  siraiegies  for  fault  management  are  effective  for  a  narrow  range  of  fault  classes.  In 
survivable  C3  systems,  a  wide  range  of  operating  environments  may  be  encountered  that 
require  different  strategies  to  be  used  at  different  times.  Tins  repon  presents  a  classification 
for  fault  management  strategies  that  can  be  used  to  define  the  most  appropriate  methods  for 
assuring  survivability  under  conditions  that  can  suddenly  and  drastically  change. 

Two  metrics  by  which  this  adaptivity  can  be  specified  and  evaluated  -  the  objective 
function  and  a  coiusistency  measure  —  are  defined.  These  metrics  can  be  used  tj  determine 
how  well  a  survivable  system  meets  its  retjuirements  under  the  current  operating  environment. 
Policies  are  discussed  that  group  strategies  by  the  tradeoffs  that  can  be  made  when  the  system 
is  unable  to  meet  its  requirements  for  objectives  or  consistency  due  to  failures,  as  well  as  by 
the  fault  classes  that  can  be  handled. 
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A  TAX05^0MY  FOR  ADAPTIVE  FAULT  MANAGEMENT 
IN  SURVIVABLE  C3  SYSTEMS 

Final  Report 

AFOSR  Summer  Faculty  Research  Program 
16  August  1991 

Rex  E.  Gantenbein,  Associate  Professor 
Department  of  Computer  Science 
University  of  Wyomins 
P.O.Box  3682 
Laramie  WY  82071 

INTRODUCTION 

In  ihe  contexi  of  computing  systems,  dependable  has  been  defined  as  a  qualitative  term 
that  characterizes  .s  system  that  can  be  justifiably  trusted  to  deliver  the  required  service  when 
needed  !AvizicuisS9|.  Such  systems  have  become  important  of  late,  due  to  the  increased 
dependency  of  c  imnieaial  and  military  operations  on  complex  computations  that  cannot  be 
implemented  as  simple,  easily  testable  sequential  programs.  As  these  systems  are  used  in 
more  complex  ein  iromnents,  the  range  of  circumstances  that  are  encountered  in  the  course  of 
their  operation  iii..rea.scs-  A  complex  system  intended  to  be  dependable  must  be  designed  to 
meet  and  liaiuile  .i  uuie  \arieiy  of  potential  problems  in  the  course  of  its  operation. 

Unforiiin..ic!>.  most  strategies  intended  to  provide  dependability  consider  only  a  few  very 
specific  i.ssues.  IZscn  the  notion  of  dependability  itself  is  typically  divided  into  subsections, 
such  as  reliabiliiN,  availability,  performance,  etc.,  each  of  which  is  addressed  by  a  number  of 
methods  th.a  tlaini  lo  iiKica.se  the  .system’s  ability  to  withstand  perturbations  in  its  operating 
environment.  'I'l  esc  inctluxls  provide  increases  in  the  .system’s  resilience  to  particular  types 
of  problems,  l  i.:  .!k  .  uniiptions  that  are  made  about  the  environment  (often  to  make  the  pro¬ 
posed  solution  ti.ic tabic  ui  efficient)  limit  the  effectiveness  of  the  method  to  exactly  those  cir- 
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cumsiaiKres  specified  in  the  assumpiions.  As  long  ns  die  enviionmenial  nssuinpiions  arc  noi 
violaied,  a  dependable  sysfem  should  be  able  lo  achieve  its  specified  goals,  even  in  the  f.  ce  of 
definable  failuies  such  as  processor  or  communicafion  losses. 

The  result  of  this  optimization  for  a  panicular  operating  environment  is  that  most 
dependable  systems  tend  to  be  rigid  and  brittle.  Any  violation  of  a  system’s  assumptions  may 
cause  it  to  behave  in  w'ays  that  are,  at  best,  suboptimal  and,  at  worst,  unexpe'  ied.  .X’ormally, 
the  system  designer  attempts  to  specify  ti  e  expected  environment  as  pan  of  the  requirements 
definition,  which  sets  out  the  expectations  for  the  system  in  terms  of  things  such  as  func¬ 
tionality,  performance,  and  precision  of  produced  results.  Since  specification  is  still  an  imper¬ 
fect  craft,  complete  description  of  an  environment  is  difficult.  .Many  assumptions  about  the 
operating  environment  of  a  system  are  made  implicitly  by  the  designer  and  implementor. 

Since  the  same  strategy  that  works  under  one  set  of  assumpiions  about  the  operating 
environment  may  not  work  under  a  different  set,  what  was  an  effective  strategy  in  one  case 
may  be  ineffective  and  even  harmful  under  another.  A  rapid  and  drastic  change  in  the 
environment,  such  as  is  sometimes  seen  in  field-deployed  C3  systems  under  combat  condi¬ 
tions,  can  cause  catastrophic  failure  in  the  system  if  the  highly  adverse  environment  is  not 
included  in  the  design  requirements.  On  the  other  hand,  a  system  that  is  capable  of  surviving 
in  a  combat  environment  may  not  be  effective  or  efficient  in  "normal"  (noncombat)  usage. 
Even  the  mission  of  a  C3  system  can  change  over  time  (from  strategic  planning  to  theater 
operation  to  tactical  battle  management,  for  example),  and  strategies  appropriate  for  one  mis¬ 
sion  may  not  be  for  another. 

There  are  numerous  factors  affecting  fault  management  that  can  suddenly  change  For 
example,  unc  factor  common  to  many  approaches  is  the  ability  to  replace  system  components 
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ihai  have  maifunciioned.  Obviously,  any  s>>iem  in  ivhich  a  conr.poneni  cannoi  be  replaced  is 
more  susceptible  to  additional  malfunctions  than  one  in  which  replacement  is  possible.  Since 
some  strategies  for  avoiding  failure  are  more  effective  for  single-component  malfunctions  than 
for  multiple  malfunctions,  the  designer  must  choose  a  strategy  appropriate  for  what  he  or  she 
believes  to  be  the  most  likely  (or  perhaps  the  most  dangerous)  circumstances. 

It  is  easy  to  see  that  in  this  case  the  same  system  can  encounter  widely  different  cir¬ 
cumstances  during  its  operation.  The  system  may  at  any  time  find  that  replacements  have  sud¬ 
denly  become  unavailable.  One  could  also  imagine  that  the  system  could  be  deployed  in  a 
mode  that  precluded  the  replacement  of  peripherals  except  during  times  of  scheduled  mainte¬ 
nance.  Under  these  conditions,  failure  of  a  component  would  be  very  detrimental  to  the 
potential  for  continued  operation  of  the  system.  In  fact,  it  might  be  useful  to  consider  replac¬ 
ing  components  before  they  fail,  rather  than  after,  so  that  their  loss  is  less  likely.  Predicting 
the  likelihood  of  failure  in  a  component  is  a  difficult  task,  but  it  may  be  necessary  if  the  sys¬ 
tem  cannot  be  taken  down  for  an  extended  period  of  time,  as  when  an  threat  is  imminent,  for 
example. 

Clearly,  what  this  system  needs  is  to  be  adaptive,  that  is,  able  to  operate  under  different 
sets  of  environmental  conditions  without  violating  its  requirement.s,  which  may  also  vary  with 
the  conditions.  In  a  survivablc  system,  the  primary  requirement  is  to  avoid  catastrophic 
failure,  even  if  it  means  relaxing  or  ignoring  requirements  that  cannot  be  met  under  adverse 
conditions.  This  behavior  is  usually  referred  to  as  graceful  degradation.  Since  the  transition 
between  one  environment  and  aiiotlier  may  occur  too  quickly  for  an  operator  to  detect,  or  may 
be  due  to  problems  within  the  system  itself  not  observable  by  an  operator  until  it  is  too  late, 
the  system  must  al.so  be  able  to  detect  (or  predict)  such  transitions  in  order  to  be  able  to  con- 


3-4 


trol  (or  prevent)  the  failures  that  can  accompany  them. 


ADAPTIVE  FAULT  .MANAGEMENT  IN  SURVIVABLE  SYSTEMS 

The  purpose  of  this  paper  is  to  present  a  categorization  of  strategies  for  fault  manage¬ 
ment  in  survivable  systems  that  defines  the  circumstances  under  which  strategies  may  be 
appropriately  applied.  Fault  management  is  the  application  of  techniques  in  a  computing  sys¬ 
tem  to  prevent  the  system  from  failing  to  deliver  its  required  service  due  to  faults  in  any  of  its 
components  (hardware  or  software).  We  use  the  term  "fault  management"  rather  than  "fault 
tolerance"  because  the  strategies  used  to  avoid  catastrophic  failure  include  prediction  and 
approximation  as  well  as  the  more  well-known  methods  such  as  masking  and  recovery. 
Moreover,  although  not  addressed  explicitly  in  this  paper,  it  is  a  commonly  held  belief  that 
the  achievement  of  highly  dependable  systems  requires  the  application  of  fault  management 
strategies  at  all  stages  of  a  system’s  life,  from  design  to  testing  to  operation  to  maintenance. 
The  categorization  to  be  presented  here  may  be  applied  equally  well  to  strategies  for  design 
and  maintenance  and  to  operational  strategies. 

We  address  these  strategies  in  the  context  of  distributed  C3  systems.  For  the  greatest 
likelihood  of  survival,  a  C3  system  must  be  distributed  over  several  processing  nodes,  geo¬ 
graphically  dispensed  and  loosely  coupled  by  communication  links.  We  are  panicularly 
interested  in  heterogenous  nodes  that  are  devoted  to  a  single  application,  but  capable  of  auto¬ 
nomously  operating  in  both  dedicated  and  general-purpose  modes.  These  processors  should 
support  recoiiiiguration  and  redistribution  of  the  system  components  in  case  failures  in  any 
of  the  three  bi.,,ic  clas.ses  of  s>stem  resources  [Kohler81|.  processing  (generating  or  transfomi- 
ing  information),  data  y., luring  information),  or  communication  (transmitting  information). 
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Failure  is  the  inability  of  a  system  to  deliver  a  required  service.  The  cause  of  a  failure 
is  a  malfunction  in  the  system  operation  that  results  in  an  incorrect  system  "state "  which 
renders  the  system  incapable  of  delivering  the  service.  This  incorrect  state  is  an  error.  The 
cause  of  an  error  is  a  fault  in  the  software  or  hardware  that  was  expected  to  produce  the  ser¬ 
vice.  This  hierarchical  definition  can  be  extended  for  components  so  that  we  can  say  a  failure 
in  a  component  becomes  a  fault  in  a  higher-level  component  that  depends  on  the  undelivered 
service. 

A  system  that  suppons  adaptivity  can  potentially  achieve  greater  resilience  to  failures 
under  (possibly  rapidly)  changing  environments  by  allowing  the  system  to  dynamically  change 
its  fault  management  strategy.  Predictive  strategies  are  those  that  stress  avoidance  of  errors, 
based  on  what  the  system  thinks  will  happen,  while  reactive  strategies  are  those  that  stress 
tolerance  of  errors  that  have  already  occurred  and  been  detected.  While  it  is  itrpossible  for  a 
system  to  control  its  environment  (or  external  state)  and  therefore  the  results  of  its  execution 
(or  internal  state),  it  is  possible  to  predict  or  react  to  changes  in  those  states  and  apply 
different  strategies  based  on  those  predictions  or  reactions. 

We  therefore  define  adaptive  fault  management  as  the  dynamic  selection  of  fault 
management  strategies  and  the  mechanisms  by  which  the  strategies  are  implemented,  based  on 
predictive  or  leactive  information  about  the  .system  state,  either  external  or  internal.  We  also 
define  a  general  model  of  adaptive  fault  management,  as  shown  in  Figure  1.  Given  a  library 
of  fault  management  strategies,  the  system  must  evaluate  the  current  external  state  (including 
operation  mode  and  mission)  and  the  current  internal  state  (including  error  rate,  fault  type, 
resources  available,  re.soiircc  utilization,  and  performance  metrics)  to  determine  the  current 
computing  environment  Using  a  ^et  of  rules  that  relate  the  environment  to  the  requirements 
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far  the  system,  an  adaptive  behavior  manager  selects  a  fault  management  strategy.  This  stra¬ 
tegy  is  used  in  scheduling  of  system  tasks  and  fault  management  mechanisms  and  in  deter¬ 
mining  the  parameters  for  the  execution  of  those  mechanisms. 


•  Change  Parameters 
of  FT  Mechanisms 

•  Schedule  Tasks 


Figure  1.  A  model  for  adaptive  fault  management. 

SPECIFYING  ADAPTIVITY  IN  SOFTWARE  SYSTEMS 

In  order  for  adaptivity  such  as  we  are  describing  here  to  be  effectively  used,  it  is  neces¬ 
sary  that  the  system  designer  be  able  to  specify  the  acceptable  system  behavior  in  a  given 
environment.  We  need  a  way  by  which  the  system  can  be  evaluated  within  an  environment  to 
see  whether  the  current  fault  management  strategy  is  still  effective  or  whether  it  needs  to  be 
changed.  In  this  .section,  we  define  two  metrics  that  can  be  used  to  define  a  system’s  require- 
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ments  for  adaptivity  and  to  evaluate  how  well  the  system  is  performing  in  its  current  environ¬ 
ment  with  respect  to  those  requirements. 

Objectl.e  functions 

We  propose  first  a  technique  for  specifying  the  requirements  of  a  computer  system,  its 
subsystems,  and  its  components  to  allow  the  efficient  evaluation  of  the  effects  of  the  environ¬ 
ment  on  the  system.  Note  that  we  are  not  attempting  to  provide  validation  of  the  system,  but 
only  an  efficient,  quantifiable  metric  to  measure  changes  in  the  environment  and  their  effect 
on  the  system’s  ability  to  meet  its  requirements.  This  objective  junction  is  a  partial 
specification  of  the  .system  that  uses  derived  (predictive)  values  or  observed  (reactive)  values 
to  represent  the  environment  and  computes  a  set  of  values  for  particular  attributes  that  can  be 
compared  to  a  set  of  desired  or  expected  values  that  represent  the  specification  for  those  attri¬ 
butes. 

The  purpose  of  this  function  is  to  capture  in  a  computable  definition  the  services  required 
of  the  system  and  the  environmental  factors  under  which  those  services  can  be  expected  to  be 
delivered.  The  function  must  be  able  to  express  all  objectives  that  affect  its  dependability, 
including  functionality,  the  extent  of  the  services  provided  by  the  system,  perjormance,  the 
time-related  attributes,  security,  a  measure  of  the  system’s  resilience  to  maliciously  (as 
opposed  to  inadvenently)  introduced  faults,  safety,  the  requirements  of  the  system  that  deal 
with  the  dangers  that  must  be  considered,  and  precision,  which  we  can  define  as  a  measure  of 
the  latitude  that  will  be  accepted  in  the  system’s  meeting  its  specified  requirements. 

There  are  two  types  of  attributes  that  can  be  involved  in  such  a  function.  Factors  are 
attributes  that  are  inherent  to  the  system  or  component  being  described  by  the  function  and 
are  not  affected  by  changes  in  the  environment.  Tho.se  attributes  that  can  vary  in  respon.se  to 
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changes  in  the  environment  are  then  the  parameters  to  the  function.  Although  in  most  cases 
the  factors  in  an  objective  function  will  be  fixed  for  a  given  component  or  system,  we  prefer 
the  term  "factor"  to  "constant"  since  a  fixed  value  in  one  function  may  vary  in  a  different 
function,  and  since  the  value  of  a  factor  may  be  assumed  rather  than  known. 

Examples  of  objective  functions  can  be  found  in  many  forms,  both  practical  and  theoreti¬ 
cal.  A  frequently  encountered  example  of  an  objective  function  is  the  reliability  measure. 
This  kind  of  metric  usually  forms  the  basis  for  analysis  of  the  predicted  reliability  of  a  system 
or  component,  based  either  on  failure  data  observed  in  testing  or  on  values  predicted  from 
modeling.  Reliability  measures  can  be  combined  with  performance  measures  to  describe  an 
attribute  sometimes  called  performability.  Other  metrics  evaluate  requirements  in  terms  of 
availability,  security,  and  precision. 

Most  often,  dependability  measures  are  used  during  the  design  and  testing  of  a  system, 
not  its  operation.  Thus,  the  evaluation  metrics  are  usually  computed  manually  or  by  using 
automated  tools.  A  few  experimental  operating  systems  provide  implementations  of  a  partial 
objective  function  that  is  used  for  scheduling  and  resource  management  based  on  the  stated 
requirements  and  characteristics  of  the  components  of  a  system. 

The  objective  function  gives  us  a  model  for  determining  whether  a  system  is  behaving 
"acceptably"  with  regard  to  its  requirements.  We  can  compute  values  for  the  attributes  from 
the  function,  using  the  parameters  as  a  representation  of  the  internal  and  external  slates  of  the 
system  (i.e.,  of  the  current  environment).  If  these  computed  values  match  the  required  values 
for  the  attributes,  then  the  system  meets  its  objectives  under  the  current  environment. 

Furthermore,  this  model  allows  us  to  define  violations  of  the  requirements  and  gives  us  a 
method  of  detecting  when  they  occur.  If  the  computed  value  of  an  attribute  does  not  match 
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its  specified  value,  then  we  say  that  an  anomaly  in  that  attribute  has  occurred,  indicating  that 
the  requirement  for  that  attribute  has  not  been  met.  Anomalies  can  be  defined  independently 
for  all  attributes  specified  in  an  objective  function. 

Notice  that  many  types  of  errors  can  be  detected  using  this  model.  Performance 
anomalies,  such  as  missed  deadlines  or  timeouts,  are  simply  a  case  of  a  performance  attribute 
(time)  whose  value  exceeds  a  specified  threshold.  Functionality  anomalies,  like  a  data  item 
that  is  not  available,  can  be  expressed  as  a  value  of  zero  for  an  attribute  where  a  nonzero 
value  is  required.  The  advantage  of  our  model  is  that  we  can  consider  a  range  of  values  for 
any  attribute  in  an  objective  function  and  can  thus  specify  a  range  of  objective  values  that 
allow  for  a  tolerance  in  meeting  the  requirement.  This  tolerance  supports  adaptivity  by  asso¬ 
ciating  different  strategies  for  fault  management  with  different  (possibly  overlapping)  ranges 
of  attribute  values.  A  variety  of  policies  for  managing  adaptivity  in  a  system  can  be 
described  with  this  model  as  well,  such  as  maximizing  the  value  of  the  objective  function  or 
more  complex  policies  such  as  minimizing  the  distance  of  all  attribute  values  from  their 
specified  values  or  maximizing  the  likelihood  that  all  attributes  will  meet  their  specified  objec¬ 
tives. 

Consistency  measures 

A  second  consideration  in  specifying  adaptivity  in  a  survivable  distnbuted  system  is  the 
coordination  of  resources  involved  in  the  same  computation.  As  mentioned  earlier,  one  of  the 
reasons  that  distributed  computing  is  so  imponant  to  the  development  of  survivable  systems  is 
that  it  provides  a  basis  for  replication  of  the  processing,  data,  and  communication  resources  on 
which  providing  continued  service  in  the  presence  of  failures  depends.  However,  the  presence 
of  multiple  versions  or  copies  of  a  resource  implies  that  the  requirements  for  that  resource 
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apply  to  all  versions.  As  we  shall  see,  however,  an  adaptive  system  may  allow  replicated  ver¬ 
sions  of  a  resource  to  have  different  views  of  the  resource.  The  question  then  becomes  how 
to  specify  the  degree  of  similarity  in  the  individ’^al  views. 

To  measure  the  extent  of  agreement  or  "sameness"  of  a  view  of  a  resource  by  a  set  of 
computations,  we  can  apply  a  consistency  measure  to  each  common  attribute  of  the  objective 
functions  among  the  computations.  This  measure,  which  we  define  qualitatively,  provides  for 
agreement  among  the  views  of  a  resource  at  three  levels.  Nonmutual  consistency  describes 
the  situation  in  which  a  set  of  computations  have  completely  independent  views  of  a  resource, 
although  they  are  all  constrained  by  the  requirements  pertaining  to  it.  An  example  of  this  is 
concurrency  control  on  a  shared  database,  in  which  any  computation  may  access  the  database 
independently,  although  each  is  constrained  to  leave  it  in  a  state  that  is  consistent  with  some 
specified  invariant. 

Mutual  consistency  is  a  stronger  condition  in  that  there  is  implicit  in  its  definition  both 
replication  of  the  resource  and  some  relationship  among  the  computations’  views  of  the 
resource.  In  mutual  replicated  consistency,  all  views  of  a  resource  by  a  set  of  computations 
are  identical  and  consistent  with  each  other  at  all  times.  In  mutual  nunreplicated  consistency, 
the  views  are  not  necessarily  identical,  but  are  consistent  with  respect  to  each  other.  For  a 
database,  mutual  consistency  implies  that  all  computations  accessing  the  database  agree  on 
what  is  in  the  database  at  any  given  moment.  Mutual  replicated  consistency  requires  that  the 
views  be  exactly  the  same  as  well,  a  condition  that  can  be  achieved  using  serialized  transac¬ 
tions.  Mutual  nonreplicaied  consistency  requires  only  that  the  computations  agree  on  the  con¬ 
tents  of  the  database  without  necessarily  seeing  the  complete  database,  as  occurs  in  a  rela¬ 
tional  database  among  multiple  database  servers  responsible  for  m.aintaining  each  relation. 
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We  can  pvc  examples  of  linese  le.'  Js  cf  coemsEcmry  Ifor  Elite  psueessin^  seseomne  class  as 
wdL  Many  fault  nam§cmem  schemes  lely  on  a  leplkarted  gmcip  pmxsscs  ilias  coopcsaic 
to  provide  single  service.  For  this  service  to  be  conEinmnisIy  available,  tine  process  group 
must  have  the  same  \'iew  of  the  group  membership  at  the  same  time.  When  the  leader  of  a 
group  fails,  the  others  must  know  who  belmigs  to  the  group  ai  ..lat  time  in  order  to  agree  on  a 
new  leader.  This  is  therefore  an  e.\ample  of  mutually  replicated  conastency  in  processing. 

Mutual  nonreplicated  consistency  in  processes  is  seen  in  the  idea  of  a  group  of  distinct 
processes  communicating  among  themselves.  We  may  choose  to  use  process  rollback  and 
restart  in  the  case  of  a  process  failure.  In  this  situation,  however,  it  is  possible  that  the  roil- 
back  of  one  process  will  trigger  the  rollback  of  a  process  that  communicated  with  the  first  one 
in  Older  to  undo  the  effects  of  the  communication  between  them.  This  second  rollback  might 
trigger  the  rollback  of  a  third  process,  and  so  on,  until  the  entire  group  has  been  rolled  back 
to  some  point  far  beyond  the  point  of  failure  of  the  original  process.  Although  proposed  tech¬ 
niques  for  limiting  this  "domino  effect"  vary,  it  is  not  necessaiy  in  any  of  them  that  every 
process  know  about  every  other  process  in  the  conversation  in  order  to  avoid  the  effect 
Since  the  processes  are  related  to  the  conversation,  but  not  eveiy  process  sees  the  entire 
group,  such  a  scheme  maintains  consistency  among  the  processes  at  the  mutual,  but  nonrepli¬ 
cated,  level. 

Nonmutual  consistency  in  processes  is  found  in  situations  similar  to  that  of  the  data 
example,  only  with  rcsou'ccs  or  devices  rather  than  data  items.  In  most  operating  systems, 
scheduling  of  processes  for  execution  on  either  a  CPU  or  a  group  of  proces.sors  is  constrained 
by  the  requirement  that  processes  give  up  a  processor  when  waiting  tor  a  system  function, 
such  as  I/O,  to  be  completed.  The  processes  are  required  to  leave  the  processor  in  an  orderly 
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siaie  so  another  psoDcsis  caa  he  cse^^ 


Examines  of  the  vaivin*  levels  of  coassssenqi'  aie  also  seen  in  cosnmunicaiioa.  The  idea 
of  an  afonne  bfosdeast,  in  whidi  all  {socesses  invc4ved  must  see  the  same  message  or  else 
none  of  them  msnr  see  it,  is  constrained  to  be  coosisient  at  the  mutual  iqilicated  leveL  A 
mnitimedia  oommuiucation  such  as  simultaneous  voice  and  data  transmission  is  an  example  of 
mutual  nomeplicated  con^ency.  in  wiiidi  the  icceiving  processes  can  compare  their  views  to 
synchronize  the  transmisnons  and  assure  correct  reception.  Reliable  message  passing  over  a 
network  is  a  simple  example  of  a  tKMunutually  consistent  requirement,  in  that  all  communica' 
tion  over  the  netwevk  is  independent  but  mua  follow  a  transmission  protocol  to  avoid  interfer¬ 
ing  with  each  other. 

CHOOSING  A  STRATEGY  BASED  ON  REQUIREMENTS 

By  combining  a  measure  of  consistency  like  that  proposed  above  with  the  notion  of  the 
objeedve  function,  we  can  evaluate  how  well  a  group  of  computations  in  a  distributed  system 
is  able  to  maintain  the  serviceCs)  they  are  expected  to  provide.  Specifying  the  objective 
values  that  must  be  met  and  the  level  of  consistency  that  must  be  maintained  specifies  the 
bounds  for  both  individual  and  group  behavior  that  can  be  considered  "acceptable."  (Note 
that  a  single  process  providing  a  service  is  a  degenerate  case  of  nonmutual  consistency.) 

Of  course,  anomalies  in  the  achievement  of  an  objective,  such  as  functionality,  perfor¬ 
mance,  or  precision,  can  affect  the  system’s  ability  to  achieve  the  required  level  of  con¬ 
sistency,  and  vice  versa.  Fault  management  strategies  can  prevent  these  anomalies  either  from 
occurring  or  from  affecting  the  ability  of  the  system  to  meet  its  other  requirements.  What  we 
would  like  to  do  now  is  to  determine  which  strategies  can  be  applied  under  different  environ¬ 
ments  to  various  anomaly  clas.ses  so  that  we  can  maintain  acceptable  behavior  in  the  system. 
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We  would  also  like  so  be  able  lo  deieraiine  uiien  this  is  not  possible,  so  that  uc  may  consider 
chan^g  scaiedes  when  conditions  indicate  it  may  be  advantageous  to  do  so. 

Since  an  anomaly  can  be  either  detected  (x  predicted,  we  may  use  them  to  either  recover 
from  or  avoid  errors  and  potential  failures.  The  number  and  severity  of  the  anomalies  can 
guide  the  choice  of  a  strategy.  Mote  importantly,  howrever,  the  presence  of  anomalies  indi¬ 
cates  the  ways  in  which  the  system  is  not  behaving  acceptably.  It  may  be  that  the  require¬ 
ments  fm  this  behavior  can  be  met  if  other  requirements  are  relaxed  or  ignored.  In  other 
wmds,  we  may  choose  to  change  strategies  based  on  tradeoffs  among  the  system  require¬ 
ments,  deemphasizing  some  to  maintain  others. 

The  idea  of  trading  off  one  requirement  to  maintain  another  is  not  new.  The  most  typi¬ 
cal  tradeoff  is  functionality  against  performance.  Some  systems  maintain  full  functionality  of 
tasks  when  processing  or  communication  anomalies  occur,  either  by  providing  full  redundancy 
of  all  the  tasks  at  all  nodes  or  by  dynamically  modifying  the  scheduling  to  continue  the  task 
processing  at  another,  reachable  node.  This  kind  of  scheme  tends  to  reduce  the  performance 
capabilities  of  a  system  for  a  given  number  of  proces.sors,  especially  in  the  second  case  if  the 
processors  were  already  being  highly  utilized.  Other  .systems  reduce  functionality  by  dropping 
less  "important"  tasks  as  the  ability  to  meet  performance  goals  is  reduced.  This  scheme  is 
typically  found  in  real-time  systems,  where  certain  tasks  identified  as  critical  must  have  every 
attempt  made  to  schedule  them  by  their  deadline,  even  if  other  tasks  must  be  delayed  or 
deleted. 

In  many  types  of  dependable  systems,  such  tradeoffs  between  objective  requirements  are 
sufficient  to  provide  the  required  level  of  .service  under  a  narrowly  defined,  highly  predictable 
range  of  adversity.  In  survivable  systems,  however,  it  is  necessary  to  consider  the  possibility 
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wide^iead,  clustered  failures  that  cannot  be  compensated  for  by  such  methods.  If  giving 
objectives  is  insufficient  to  allow  a  system  to  survive  such  loss,  then  we  must  consider  giv¬ 
ing  up  consistency  instead.  This  approach  can  be  found  in  distributed  systems  that  handle 
pattidphing  of  the  system  by  supporting  independent  computations  that  may  later  be  recon¬ 
ciled  to  restore  consistency  among  them. 

We  therefore  present  another  categorization  of  fault  management  strategies  based  on  this 
notion  of  relaxing  or  trading  off  requirements  in  exchange  for  continued  survival  of  the  sys¬ 
tem.  We  define  three  policies  under  which  strategies  may  be  applied:  opiimisiic,  for  environ¬ 
ments  in  which  so  few  anomalies  are  expected  that  they  can  be  explicitly  detected  and  han¬ 
dled  without  long-term  effects  on  other  requirements  of  the  system;  pessimistic,  for  environ¬ 
ments  in  which  a  sufficient  number  or  frequency  of  anomalies  are  expected  as  to  require  the 
relaxing  of  one  or  more  of  the  objective  requirements  of  the  system  to  avoid  their  causing 
other  anomalies  and  possibly  catastrophic  failure;  and  ultrapessimisiic,  for  environments  in 
which  the  anomalies  that  have  occurred  require  the  relaxation  or  abandonment  of  consistency 
requirements  in  order  to  preserve  the  critical  services  of  the  system. 

Optimistic  strategies  depend  on  explicit  detection  of  and  recovery  from  anomalies  in  the 
system.  Their  usual  response  to  detection  of  an  anomaly  is  restoration  of  a  consistent  system 
state,  followed  either  by  an  attempt  to  repeal  the  operation  that  failed  or  by  reconfiguration  of 
the  system  to  include  an  alternative  version  or  spare.  This  is  usually  done  in  a  single- 
threaded  mode,  so  that  the  recovery  can  take  place  with  the  minimal  allocation  of  resources 
and  with  the  minimal  disruption  of  other  computations.  Since  consistency  among  a  group  of 
related  computations  must  be  preserved,  often  these  approaches  require  a  protocol  for  rollback 
of  multiple  processes. 
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Cleaiiy,  this  kind  of  strategy  is  sufficient  when  error  rates  are  low  and  there  are  sufficient 
resources  (including  time)  to  recover  from  anomalies.  However,  as  error  rates  increase,  or  the 
environment  becomes  such  that  recovery  would  affect  the  operation  of  the  system  too  greatly, 
optimistic  strategies  become  ineffective  or  too  costly.  Instead,  we  need  to  consider  using  pes¬ 
simistic  strategies  that  avoid  anomalies  by  relaxing  one  or  more  of  the  objective  requirements 
in  order  to  maintain  the  other  requirements,  including  consistency.  Typically,  these  strategies 
operate  in  a  multi-thread  fashion  and  avoid  failures  by  scheduling  or  masking  of  anomalies. 
This  kind  of  approach  will  handle  a  higher  error  rate,  but  tends  to  use  resources.  By  relaxing 
the  objective  requirements,  we  can  continue  to  operate  and  avoid  failures,  although  we  pay  the 
price  in  loss  of  functionality,  performance,  or  precision. 

We  may  also  choose  to  give  up  consistency  to  maintain  the  specified  objectives.  This  is 
not  a  widely  used  approach,  since  the  notion  of  consistency  is  seldom  seen  as  a  requirement 
that  can  be  specified  at  different  levels.  In  this  situation,  anomalies  can  be  masked  or  even 
ignored  with  respect  to  their  effect  on  consistency,  so  recovery  is  kept  to  a  minimum.  Degra¬ 
dation  of  service  may  be  required,  but  we  can  determine  which  services  to  eliminate,  if  we 
know  which  ones  are  related  to  maintaining  the  objectives  and  which  to  consistency.  It  would 
appear  that  this  kind  of  approach  would  tolerate  fairly  high  error  rates,  since  it  allows  the  sys¬ 
tem  to  ignore  certain  kinds  of  errors  and  requires  little  resource  overhead.  In  fact,  the  relaxa¬ 
tion  of  consistency  requirements,  coupled  with  the  degradation  of  service,  may  provide 
enough  processing  power  for  the  system  to  recover  some  of  its  objectives  (particularly  func¬ 
tionality)  on  its  own,  without  explicit  recovery  mechanisms. 

One  reason  that  iiltrapessimistic  approaches  have  not  been  studied  more  may  be  that  they 
are  hard  to  describe  in  non-application  specific  terms.  It  is  difficult  to  determine  the  most 
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caacdl  functions  of  an  application  and  where  consistency  may  be  safely  given  up  without  a 
dgnificant  amount  of  a  priori  information  about  the  application.  Less  information  of  this  son 
is  needed  for  pessimistic  strategies,  since  avoidance  techniques  can  often  make  use  of  model- 
ihg.  and  prediction  to  determine  the  parameters  of  the  objective  function.  Optinustic 
approaches  require  the  least  a  priori  information  of  all,  since  the  application  can  use  informa¬ 
tion  collected  from  execution  to  estimate  rather  than  predict  the  parameter  values. 

The  identification  of  these  three  categories  for  fault  management  strategies  allows  us  to 
consider  which  strategies  may  be  most  effective  when  the  surrounding  environment  rapidly 
and  drastically  changes.  For  example,  a  radar  system  may  operate  in  three  modes:  ready 
mode,  in  which  few  errors  are  expected  and  can  be  repaired  at  relative  leisure;  alert  mode,  in 
which  errors  must  be  avoided  due  to  the  need  to  maintain  the  system  in  full  operation;  and 
conflict  mode,  in  which  the  system  is  experiencing  high  losses  of  components  and  has  limited 
resource  and  replacement  capabilities.  We  could  choose  to  use  an  optimistic  strategy  in  the 
first  situation,  a  pessimistic  strategy  in  the  second,  and  an  ultrapessimistic  strategy  in  the  third. 

A  TAXONOMY  FOR  ADAPTIVE  FAULT  MANAGEMENT 

We  can  summarize  this  discussion  as  a  taxonomy  for  fault  management  strategies.  Pre¬ 
vious  taxonomies  related  to  this  problem  have  not  considered  adaptivity  as  an  issue,  concen¬ 
trating  instead  on  comparing  specific  hardware  or  software  architectures,  or  else  they  provide 
a  high-level  view  of  dependable  systems  that  is  of  little  use  in  the  detailed  design  and  analysis 
of  such  systems. 

A  taxonomy  that  considers  adaptivity  is  shown  in  Figure  2.  This  taxonomy  allows  us  to 
classify  strategies  for  fault  management  according  to  the  resource  classes  to  which  they  apply 
and  the  level  of  consistency  that  they  maintain.  Within  these  categories  we  can  define  both 
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tlie  .iwpiii^ies  that  can  indicate  that  an  objective  requirement  has  been  violated,  so  we  can 
i^t^  the  need  for  a  new  strategy,  and  the  strategies  that  may  be  applied  for  each  policy. 

R]^PURCE  CLASS  (Piocessing,  Data,  Communication) 


CONSISTENCY 

ERROR  DETECTION 

STRATEGIES 

Mutual  replicated 

Functional  anomalies 
Performance  anomalies 
Precision  anomalies 

Optimistic 

Pessimistic 

Ultrapessimistic 

Mutual  nonreplicated 

Functional  anomalies 
Performance  anomalies 
Precision  anomalies 

Optimistic 

Pessimistic 

Ultrapessimistic 

Nonmutual 

Functional  anomalies 
Performance  anomalies 
Precision  anomalies 

Optimistic 

Pessimistic 

Ultrapessimistic 

Figure  2.  A  taxonomy  for  adaptive  fault  management  strategies. 

The  advantage  of  such  a  taxonomy  is  that  it  can  be  applied  to  a  particular  problem  to 
define  the  strategy  classes  that  are  best  for  a  given  set  of  circumstances.  A  design  paradigm 
for  survivable  systems  can  be  envisioned  that  uses  these  classifications,  first,  to  guide  how  the 
system  can  be  constnicted  to  support  adaptation  among  the  strategies  and,  second,  to  define 
the  classes  of  strategies  that  best  fit  all  the  environments  that  the  system  is  expected  to 
encounter  (even  if  the  likelihood  of  encountering  any  of  them  is  small). 

The  proposed  taxonomy  encompasses  many  of  the  well-known  fault  management  stra¬ 
tegies  for  both  software  and  hardware.  However,  most  of  these  strategies  fall  into  the 
optimistic  or  pessimistic  categories.  Very  few  strategies  exist  that  utilize  the  relaxation  of 
consistency  requirements  to  maintain  the  objectives.  In  survivable  systems  that  are  deployed 
in  potentially  harsh  environments,  such  strategies  may  prove  to  be  extremely  imponant. 
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Qindiision 

We  have  presented  a  number  of  categories  for  fault  management  strategies  that  can  be 
used  to  guide  their  selection  in  an  adaptive  survivable  system.  By  using  the  guidelines,  the 
^stem  designer  will  be  able  to  define  a  number  of  situations  under  which  different  strategies 
may  be  effectively  used,  thus  improving  the  chances  of  the  system  avoiding  catastrophic 
failure  brought  on  by  the  violation  of  its  assumptions. 

Clearly,  much  research  is  needed  in  this  area  to  make  adaptivity  a  tool  in  the  design  and 
operation  of  survivable  .systems.  First  and  foremost,  the  taxonomy  must  be  populated  with  the 
known  strategies  to  define  how  well  we  understand  the  various  categories.  It  appears,  based 
on  preliminary  work,  that  we  have  many  techniques  for  optimistic  and  pessimistic  fault 
management  in  the  areas  of  data  and  communication.  Processing  fault  management  has  made 
much  progress  in  the  area  of  optimistic  strategies,  but  less  so  in  the  area  of  pessimistic.  None 
of  these  areas  have  been  extensively  studied  with  respect  to  ultrapessimistic  straiegies. 

Practical  issues  also  need  to  be  considered.  The  major  problem  with  the  use  of  objective 
functions  is  their  definition  in  a  complete,  yet  operationally  tractable  manner.  We  need  to  be 
able  to  define  objective  functions  for  the  requirements  (at  least  those  related  to  fault  manage¬ 
ment)  in  such  a  way  that  they  can  be  evaluated.  We  must  also  consider  how  and  when  to 
make  the  transition  between  strategies,  which  will  involve  evaluating  the  tradeoffs  between 
objective  and  consistency  requirements.  It  is  not  yet  clear  if  this  can  be  done  other  than  on  an 
application-specific  basis,  although  general  principles  should  be  definable.  Similar  problems 
exist  for  the  consistency  measures. 

Other  considerations  include  how  to  choo.se  the  appropriate  strategy  from  among  those 
available.  We  will  need  to  find  an  efficient  rule  ba.se  and  evaluation  r  .hod  for  deciding 
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among  soategies  in  every  system. designed  in  this  way.  The  rules  for  making  such  decisions 
vidll  be  complex  arid  a  genet^,  of  at  least  automatable,  decider  will  be  complex  as  well. 
•Fufthefmore,  to  avoid  a  single  failure  point  in  the  decision  manager,  it  will  be  necessary  to 
reliably  distribute  this  function  over  the  nodes:  While  a  nuihber  of  operating  systems  claim  to 
support  reliable  distributed  cdmpudng,  the  evaluation  of  their  support  for  survivable  systems 
bas^  dh  the  approaches  described  here  will  require  significant  effort. 

We  haye  mentioned  that  it  may  be  posrible  to  base  a  design  paradigm  for  adaptive  sur- 
yivable  C3  systems  oti  this  taxonomy.  It  appears,  although  we  have  not  yet  verified,  that  the 
approaches  here  can  be  used  for  subsysteih  as  well  as  system  design  and  implementation. 
The  priniary  questibri.s  are  how  to  integrate  subsystems  back  into  the  larger  system  and  how  to 
control  adaptivity  among  multiple  concurrent  systems  in  the  same  application.  The  relation¬ 
ships  among  processing,  data,  and  communication  may  not  be  orthogonal,  and  the, 
spedficatidh  and  management  of  adaptivity  at  the  lower  levels  will  need  to  be  considered 
cd'efully  to  allow  the  interaction  among  the  subsystems  to  be  managed. 

Eventually,  we  hope  that  the  work  outlined  here  will  provide  a  basis  for  survivable  sys¬ 
tems  that  can  be  "engineered"  rather  than  created.  We  fully  agree  that "...  building  distributed 
fault-tolerant  systems  will  remain  an  art  in  the  foreseeable  future"  [Cristian91J,  but  we  are 
confident  that  study  such  as  this  will  help  in  defining  the  skills  needed  to  achieve  more  highly 
dependable  computing  systems  at  a  time  not  as  distant  as  we  might  otherwi.se  believe. 
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APPENDIX  1 


AN  ANNOTATED  BIBLIOGRAPHY  OF 
DEPENDABLE  DISTRIBUTED  COMPUTING 

Rex  E-  Gantehbein,  Associate  Professor 
Department  of  Computer  Science 
University  of  Wyoming 
P.O  Box  3682 
Laramie  WY  82071 

1.  OVERVIEW  OF  DEPENDABILITY  ISSUES 
Textbooks 

T.  -Anderso.i  (ed.),  Resilient  Computing  Systems,  Volume  I,  John  Wiley  and  Sons  (1985). 

An  excellent  overview  of  the  major  issues  of  dependability.  The  chapters,  each  written  by  an 
expert  in  his  or  her  field,  cover  ha’^dware,  software,  and  communication  reliability,  real-time 
and  distributed  systems,  safety,  renabllitv  measures,  and  case  studies  of  commercial  systems 
circa  1985.  I  don’t  know  wlial-Gver-rbecame  of  Volume  II. 

B.W.  Johiison,  Design  and  Analysis  of  Fault-Tolerant  Digital  Systems,  Addison-Wesley 
(1989). 

An  introduction  to  the  design  and  analysis  of  dependable  hardware.  The  emphasis  is  on  tech¬ 
niques  and  concepts,  and  the  author  claims  that  "no  previous  knowledge"  of  fault  tolerance  is 
needed.  One  chapter  deals  with  VLSI,  and  several  sample  designs  are  included  as  examples 
in  the  text. 

B.  Littlewbod  (ed.),  Software  Reliability:  Achievement  and  Assessment,  Blackwell  Scientific 
Publications  (1987). 

Another  collection  of  summaries  written  by  a  collection,  of  experts,  with  the  emphasis  on 
softw^e.  Mucli  of  the  book  deals  with  reliability  measurement,  but  other  issues  such  as  fault 
tolerance  and  software  safety  are  addressed  in  chapters  a^-  well. 

V.P.  Nelson  and  B.D.  Carroll  (eds.),  Tutorial:  Fault-Tderant  Computing,  IEEE  Comp.  Soc. 
Press  (1987). 

One  of  the  IEEE  tutorials.  This  is  a  collection  of  papers  on  a  wide  range  of  fault- tolerance 
subjects.  Although  slightly  dated  now  (the  papers  all  date  back  to  1986  or  earlier),  this 
tutorial  is  still  a  thorough  overview  of  the  pioneering  work  in  many  areas  of  dependability. 

D.K.  Pradhan  (ed.),  Fault-Tolerant  Computing:  Theory  and  Techniques,  Volume  1  and  II 

A  two-volume  set.  Volume  I  deals  primarily  with  circuit  level  techniques,  including  test  gen¬ 
eration,  designing  for  testability,  fault  simulation,  and  coding  theory  and  techniques.  Volume 
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II  deals  with  system-level  techniques,  including  architecture  (general  and  multiprocessor), 
diagnosis,  reliability  estimation  tools  and  techniques,  and  software  fault  tolerance.  1  prefer  the 
Anderson  book  as  an  introduction,  but  this  is  a  good  reference  for  more  esoteric  issues  and 
covers  the  circuit-level  issues  in  detaih 

Articles 

A.  Avizienis  and  J.-C.  Laprie,  Dependable  Computing:  From  Concepts  to  Design  Diversity, 
Proc.  of  the  IEEE  74,5  (May  1986),  629-638. 

An  invited  paper  in  a  special  issue  on  fault  tolerance  in  VLSI.  The  paper  defines  the  basic 
terms  of  fault  tolerance  and  provides  a  preliminary,  high-level  taxonomy  of  dependability. 
Several  approaches  to  the  problem  of  fault  tolerance  are  considered,  with  the  emphasis  placed, 
as  would  be  expected  from  Avizienis,  on  design  diversity  as  a  means  of  tolerating  design 
faults. 

F.  Cristian,  Understanding  Fault-Tolerant  Distributed  Systems,  Comm.  ACM  34,2  (February 
1991),  56-78. 

A  comprehensive  paper  containing  both  concepts  and  case  studies  in  fault-tolerant  distributed 
computing.  The  paper  presents  a  unified  discussion  of  the  problems  to  be  solved  at  the 
hardware  and  software  architecture  levels  as  well  as  approaches  to  their  solution  based  on 
both  avoidance  (masking)  and  tolerance  (recovery).  It  is  suggested  in  the  paper  that  the  key 
to  dependability  is  balancing  failure  detection,  recovery,  and  masking  redundancy  at  various 
levels  oif  system  abstraction,  so  that  the  provision  of  dependability  at  lower  levels  will  make 
the  higher  levels  easier  to  design. 

J.  Gray,  Why  Do  Computers  Stop  and  What  Can  Be  Done  About  It?,  Proc.  5th  Symp.  on 
Reliability  in  Dist.  Software  and  Database  Sys.  (July  1986),  IEEE  Comp.  Soc.  Press,  3-12. 

A  report  on  the  results  of  studies  on  failures  in  Tandem  systems,  for  whom  the  author  worked 
at  the  time.  This  paper  is  often  quoted  as  a  source  of  data  on  reliability  in  commercial  fault- 
tolerant  systems,  even  though  the  dut.'>.  was  collected  from  u.ser  problem  reports  and  as  such  is. 
Gray  admits,  not  very  precise  and  probably  underreported.  Among  the  conclusions  is  that 
administration  and  software  were  the  major  causes  of  failure  in  these  systems. 

J.G.  Kuhl  and  S.M.  Reddy,  Fault-Tolerance  Considerations  in  Large,  Multiple-Processor  Sys¬ 
tems,  Computer  19,3  (March  1986),  56-67. 

A  survey  of  methods  and  techniques  for  achieving  hardware  fault  tolerance  in  large,  multi¬ 
computer  systems,  focusing  on  distributed  control.  The  paper  gives  a  framework  for  hardware 
fault  tolerance  in  distributed  systems  and  discusses  replication  and  masking  techniques,  diag¬ 
nosis,  repair  and  recovery,  and  communication  issues. 

J.-C.  Laprie  et  al..  Definition  and  Analysis  of  Hardware-  and  Software-Fault-Tolerant  Archi¬ 
tectures,  Computer  23,7  (July  1990),  39-51, 

A  review  of  software  fault  tolerance  techniques,  including  recovery  blocks,  N-version  pro¬ 
gramming,  and  hybrid  schemes  referred  to  here  as  N  self-checking  programs.  Faults  in 
software  differ  significantly  from  those  in  hardware,  and  architectural  designs  that  attempt  to 
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h^dle  both  are  complex.  Some  highly  abstract  solutions  to  this  problem  are  presented  here, 
along  with  ah  evaluation  of  their  effectiveness  based  on  Markovian  models  of  system 
behavior. 

y.P:  -Nelson,.  Fault-Tolerant  Computing:  Fundamental  Concepts,  Computer  23,7  (July  1990), 
19-25. 

-A  concise  overview  of  fault  tolerance  concepts,  mechanisms,  and  strategies.  The  review  is 
h^dware-oriented,  although  many  of  the  concepts  can  be  applied  to  software  as  well.  This 
paper,  as  well  as  the  Laprie  paper  above,  are  part  of  a  special  issue  of  Computer  on  fault 
tolerance  that  covers. a  wide  range  of  topics. 

Case  studies  of  commercial  systems 

J.-P.  Banatre  et  al..  The  Design  and  Building  of  Enchere,  A  Distributed  Electronic  Marketing 
System,  Comm.  ACM  29,1  (January  1986),  19-29. 

A  description  of  a  French  system  that  supports  distributed  synchronization,  atomic  activities, 
and  a  commit-protocol  with  recovery  algorithms. 

P.A.  Bernstein,  Sequoia:  A  Fault-Tolerant  Tightly  Coupled  Multiprocessor  for  Transaction 
Processing,  21,2  (February  1988),  37-45. 

A  system  with  a  multicomputer  architecture  that  uses  hardware  fault  detection  and  software 
recovery  in  the  operating  system. 

D.  Gifford  and  A.  Spector,  The  Cirrus  Banking  Network,  Comm.  ACM  28,8  (August  1985), 
798-807. 

One  of  the  series  of  case  studies  published  by  Gifford  and  Spector  in  the  Communications 
during  this  period.  This  is  actually  an  interview  with  the  president  of  CIRRUS  Systems,  Inc., 
with  discussion  of  the  high  availability  features  of  the  wide-area  network  that  supports  the 
transaction  processing. 

J.N.  Gray  and  M.  Anderson,  Distributed  Computer  Systems:  Four  Case  Studies,  Proc.  of  the 
IEEE  75,5  (May  1987),  719-726. 

A  comparison  of  four  distributed  architectures  with  respect  to  fault  management.  The  com¬ 
parison  centers  around  decentralization  of  hardware,  control,  and  redundancy  management  in 
varying  degrees. 

E. S.  Harrison  and  E.J.  Schmitt,  The  Structure  of  System/88,  A  Fault-Tolerant  Computer,  IBM 
Sys.  Journal  26,3  (March  1987),  293-318. 

Describes  the  configurations  of  the  System/88  family  of  IBM  products,  which  are  based  on 
the  Stratus/32  system.  The  system  achieves  fault  tolerance  via  hardware  duplexing  coupled 
with  a  distributed  operating  system  that  replicates  resources  transparently  across  several  nodes 
or  a  network  of  multinode  systems. 
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DIP.  Siewibrek,,  Fault  Tolerance  in  Commercial  Computers,  Computer  23,1  (July  1990),  26- 
37:- 

An  survey  of  fault  tolerance  in  commercial  systems  as  of  1990.  The  systems  reviewed 
include  uniprocessor  systems  (VAX  8600  and  IBM  3090),  multicomputer  systems  (Tandem, 
Stratus,  and  V^Xft  3000),  arid  multiprocessor  systems  (Teradata  and  Sequoia).  The  systems 
are  evaluated  as  to  the  sources  of  the  errors  that  they  handle  and  the  approaches  used.  This 
pFuer  is  found  in  the  special  issue  of  Computer  mentioned  previously. 

J. J.  Wallace  and  W.W.  Barnes,  Designing  for  Ultrahigh  Availability:  The  UNIX  RTR  Operat¬ 
ing  System,  Computer  17,8  (August  1984),  31-39. 

A  study  of  the  UNIX  RTR  operating  system  solutions  to  the  problem  of  providing  extremely 
high  availability.  This  system  is  used  by  AT&T  in  support  of  its  ESS  telephone  switching 
systems,  whichr  are  famous  for  their  (expected)  downtime  on  the  order  of  a  few  minutes  per 
year,, including  maintenance. 

2.  DEPENDABLE  DISTRIBUTED  RESOURCE  MANAGEMENT 
Data 

P.A.  Beriistein  arid  N.  Goodman,  Concurrency  Control  in  Distributed  Database  Systems,  Com¬ 
puting  Surveys  13,2  (June  1981),  165-201. 

A  sumy  of  48  methods  for  concurrency  control  in  distributed  databases.  The  paper  primarily 
considers  mechanisms  and  techniques  for  correctly  synchronizing  concurrent  access  to  data, 
with  performance,  issues  given  only  secondary  consideration.  Both  centralized  and  distributed 
schemes  are  discussed.  Of  particular  interest  is  their  identification  of  "anomalies"  in  con¬ 
sistency  in  the  data  and  the  methods  by  which  these  can  be  eliminated  (serialization,  etc.). 

K. P.  Eswaran  et  al..  The  Notion  of  Consistency  and  Predicate  Locks  in  a  Data  Base  System, 
Comm.  ACM  19,11  (November  1976),  624-633. 

The  paper  thtit  really  started  it  all  with  respect  to  concurrency  control  on  shared  data.  Is  there 
a  graduate  student  in  computer  science/engirieering  who  hasn’t  read  this?  The  paper  presents 
the  idea  of  consistency  for  databases  and  how  it  can  be  preserved  using  predicate  locks. 
Although  many  other  schemes  have  improved  on  these  ideas,  this  is  still  wonh  the  effon  to 
read  for  background  in  the  area. 

H.  Garcia-Molina  and  R.K.  Abbott,  Reliable  Distributed  Database  Management,  Proc.  of  the 
IEEE  75,5  (May  1987),  601-620. 

Algorithms  and  techniques  for  achieving  reliability  in  dispersed  and/or  replicated  data.  Some 
methods  for  handling  partitioning  of  a  database  are  also  briefly  surveyed. 

M.  Herlihy,  Comparing  How  Atomicity  Mechanisms  Support  Replication,  Proc.  4th  ACM 
Symp.  on  Principles  of  Dist.  Computing  (August  1985),  ACM  Press,  102-110. 

Compares  the  constraints  on  replication  necessary  to  maximize  the  concurrency  in  a  distri¬ 
buted  database  system.  The  author  suggests  that  availability  of  replicated  data  should  be  the 
criterion  by  which  atomicity  mechanisms  are  evaluated.  The  main  result  presented  is  that 
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hybrid  schemes  Ehai  uiilize  bods  iockisig  zsd  limessaEi^  Esscse  camBtmcv  sSmt  licek' 
ing  alone. 

WJl  Kc^Ier^  A  Seiey^’  of  Technkjssss  for  Syttidmmzsmst  and  Recomy  m  f>e<DCEiEffa!B2icd 
Compuicr  Sysiems,  Campmng  Survey’s  13^  (June  19S1X  I49-1S3. 

A  comprehensive  survey  of  techniques  for  <michioni2ung  access  eo  shared  ohjecss  in  a  disEii- 
buied  system  and  recov^ering  Ehose  objects  after  failures.  The  errq^Basis  is  on  sofsu-are  siruc- 
niring  techniques  to  achieve  eiHH'  recovery.  Although  the  paper  is  sonseuim  daBed,  these 
techniques  have  been  e.\tensively  used. 

J.D.  Noe  and  A.  .Andreassian,  Effectiveness  of  Rqrlication  in  Distributed  Ownputer  Netwwks, 
Proc.  7th  Int.  Coiif.  on  Dist.  Computing  Sys.  (September  1987),  IEEE  Cwnp.  See.  Press, 
508-513. 

A  comparison  of  voting,  regeneration,  and  available  copies  schemes  for  replication  control. 
The  schemes  ate  evaluated  on  the  basis  of  storage  costs  for  replicated  copies.  The  result 
presented  is  that  all  these  methods  increase  the  availability  of  the  data,  but  little  gain  is  seen 
with  more  than  two  copies,  making  voting  (which  requires  at  least  three  copies)  less  prefer¬ 
able  than  the  other  schemes. 

F.  Pitelli  and  H.  Garcia-Molina,  Recovery  in  a  Triple  Modular  Redundant  Database  System, 
Proc.  7th  hit.  Conf.  on  Dist.  Compiuing  Sys.  (September  1987),  IEEE  Comp.  Soc.  Press, 
514-520. 

A  description  of  a  method  for  providing  a  replicated  database  that  can  survive  failures  at  one 
of  its  nodes.  Tlic  system  will  continue  processing  transactions  at  the  other  nodes,  while  the 
lost  node  upon  repair  will  take  a  snapshot  of  the  corrupted  parts  of  the  database  and  initiate  a 
"catchup"  procc.ss  to  get  back  into  synchronization  with  the  others.  The  motivation  for  this 
approach  comes  from  results  suggesting  that  pure  recovery  in  this  system  hinders  its  ability  to 
process  transactions  efficiently. 

R.  Strong  et  al..  Handshake  Protocols,  Proc.  7th  Int.  Conf.  on  Dist.  Computing  Sys.  (Sep¬ 
tember  1987),  IEEE  Comp.  Soc.  Press,  521-528. 

Presents  a  paradigm  and  techniques  for  replicated  data  management  called  synchronous  disiri 
buted  memory.  Tlie  empliasis  is  on  maintaining  consistency  in  the  data  by  detecting  and 
recovering  from  failures  rather  than  avoiding  it,  especially  in  the  case  of  transient  partitions  in 
a  network.  A  useful  set  of  protocols  is  described  that  provide  ways  to  reestablish  consistency 
in  the  data  once  it  has  been  lost. 

J.S.M.  Verhofstad,  Recovery  Techniques  for  Database  Systems,  Computing  Surveys  10,2  (June 
1978),  167-195. 

A  old,  but  comprehensive  for  the  time,  survey  of  methods  for  maintaining  consistency  and 
availability  in  databases.  Since  most  of  these  methods  are  still  in  use,  this  article  is  still  valu¬ 
able  despite  its  age. 

Communication 
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K.  Binma  aswl  T.  JosepiB,  IkfiaMe  CwiHitunicaiion  in  iht  Presence  of  Failures,  ACM  Trans, 
on  Ccmpuier  Sys.  5,1  (Feknasy  !987). 

Describes  Eechniques  for  abe  densn  and  veriSicaiion  of  a  dismbuied  communicaiicn  facility. 
The  medxids  suf^tosn  a  fannly  of  reliable  muiiicasi  protocols  that  lespea  a  variety  of  otdeiing 
consnaints  and  litns  have  ^‘atying  consistency  and  perf(»tnance  depending  on  the  degree  of 
ocdeiing  that  mam  be  maintained.  The  piotocols  assure  that  the  processes  in  a  group  will 
observe  consistent  orderings  of  events  affecting  the  group,  where  consistency  does  not  neces¬ 
sarily  imply  identity  or  simulianeby.  This  paper,  and  the  others  by  Birman,  describe  tech¬ 
niques  used  in  the  ISIS  distributed  operating  environment  developed  at  Cornell. 

KP.  Birman  and  T.A.  Joseph,  Exploiting  Replication^  Tech.  Report  TR  88-917,  Cornell  Univ. 
DepL  of  Computer  Science  (June  1988). 

A  description  of  communication  among  replicated  processes  in  the  ISIS  distributed  system. 
The  mechanisms  provided  in  ISIS  allow  varying  levels  of  "synchrony”  among  processes  that 
range  from  IcKksiep  to  virtual  ^^uchrony,  a  level  of  consistency  in  which  communication  is 
cmisidered  synchronized  with  respect  to  message  ordering  as  long  as  the  results  are  indistin¬ 
guishable  in  the  receiving  processes. 

Procesang 

K.P.  Birman  ei  al..  Implementing  Fault-Tolerant  Distributed  Objects,  Proc.  4th  Symp.  on  Reli¬ 
ability  in  Dist.  Software  and  Database  Sys.  (October  1984).  IEEE  Comp.  Soc.  Press. 

A  reference  for  replication  control  in  ISIS.  The  system  uses  an  available  copies  scheme  that 
removes  failed  nodc.s  from  the  group  and  reconfigures  them  back  in  once  they  have  recovered. 
The  scheme  uses  the  communication  protocols  de.scribed  in  the  other  papers  listed  above  to 
support  varv'ing  levels  of  consistency  among  the  group  membership. 

B.A.  Coan  and  G.  'Hionias,  Agreeing  on  a  Leader  in  Real-Time,  Proc.  Real-Time  Sys.  Symp., 
(December  1990),  IEEE  Comp.  Soc.  Press,  166-172. 

Protocols  for  selecting  the  leader  of  a  "replication  ring”  of  processes  in  real  time.  The 
method  is  intended  for  shared  memory  systems  where  a  group  of  processes  elect  a  leader  that 
provides  a  service  to  requestors,  although  the  service  may  be  handled  by  all  members  of  the 
ring.  If  the  leader  fails,  a  new  leader  can  be  elected  in  a  bounded  time  using  the  methods 
described  here. 

F.  Cristian,  Agreeing  on  Who  is  Present  and  Who  is  Ab.sent  in  a  Synchronous  Distributed 
System,  Proc.  ISili  Ini.  Symp.  on  Fault-Tolerant  Computing  (June  1988),  IEEE  Comp.  Soc. 
Press,  206-21 1. 

Methods  for  detecting  departures  and  joins  in  a  process  group  and  reliably  reaching  agreement 
on  membership  in  a  bounded  time.  The  paper  presents  two  protocols  for  this  problem,  one  of 
which  has  fast  detection  of  changes  but  high  me.ssage  traffic  overhead,  even  when  no  changes 
occur.  The  second  ijrotoeol  has  a  longer  delay  but  has  a  provable  limit  on  message  overhead 
in  the  absence  of  failures. 
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F.  Cdstian,  A  Probabilistic  Approach  to  Distributed  Clock  Synchronization,  Proc.  Real-Time 
Sys.  Symp.  (1989),  IEEE  Comp.  Soc.  Press,  288-296. 

A  description  of  an  approach  to  clock  synchronization  that  defines  the  probability  of  getting 
distributed  clocks  to  agree  within  a  defined  precision.  This  paper  is  interesting  because  it 
exemplifies  the  issues  of  tradeoffs  for  both  objectives  and  consistency  in  an  application.  The 
protocols  presented  can  tolerate  processor  and  communication  failure  and  detect  clock  failures, 
but  the  tradeoffs  between  message  traffic,  time  to  synchronize,  and  the  probability  of  success¬ 
ful  ^nchfonization  within  a  specified  time  are  such  that  the  same  strategy  may  have  very 
different  results  depending  on  the  environment. 

MJ.  Fischer  and  N.A.  Lynch,  A  Lower  Bound  for  the  Time  to  Assure  Interactive  Con¬ 
sistency,  Infonnaiion  Processing  Letters  14,4  (June  1982),  183-186. 

A  proof  of  the  property  that  for  n  processors,  of  which  at  most  m  are  faulty,  m+1  rounds  of 
communication  are  required  for  interactive  consistency,  as  defined  in  the  paper  by  Pease  et  al. 
below.  These  are  two  of  three  papers  listed  here  that  discuss  the  issues  of  consistency  among 
processes  in  the  conie.\i  of  the  Byzantine  generals  problem  (the  third,  by  Lampon  et  al.,  actu¬ 
ally  defines  the  problem). 

E.D.  Jensen,  C.D.  Locke,  and  H.  Tokuda,  A  Time-Driven  Scheduling  Model  for  Real-Time 
Operating,  Systems,  Proc.  Real-Time  Sys.  Symp.  (December  1985),  IEEE  Comp.  Soc.  Press, 
112-122. 

Describes  a  time-driven  model  for  process  scheduling  that  includes  priority  as  a  function  of 
time.  This  time-value  function  is  supponed  in  the  Alpha  distributed  operating  system  and,  as 
such,  is  one  e.xample  of  an  implementable  objective  funcrion.  The  paper  presents  the  results 
of  experiments  in  evaluating  the  effectiveness  of  several  scheduling  algorithms  using  this 
model,  including  the  he.st-effort  .scheduler  that  is  described  in  Locke’s  thesis  below  and  imple¬ 
mented  in  the  first  releases  of  Alpha. 

M.K.  Joseph  and  A.  Avizienis,  A  Fault  Tolerance  Approach  to  Computer  Viruses,  IEEE 
Symp.  on  Security  and  Privacy  (April  1988),  IEEE  Comp.  Sci.  Press,  52-58. 

Examples  of  using  fault  tolerance  as  an  approach  for  security  violations.  This  is  an  largely 
unexplored  area,  although  it  seems  feasible  if  one  views  security  as  the  resilience  to  anomalies 
in  the  system  due  to  maliciously  (rather  than  inadvertently)  introduced  faults.  Since  the 
anomalies  may  result  in  denial  of  services  or  corruption  of  .sensitive  information,  security 
requirements  can  in  principle  be  specified  in  the  .same  way  as  other  requirements  such  as 
functionality  and  performance. 

M.K.  Joseph,  Integrating  .Security  into  Current  Dependability  Concepts  and  Models,  LAAS- 
CNRS  Tech.  Report  89194,  Toulouse,  France  (June  1989). 

Qualitative  models  for  approaches  that  combine  dependability  and  .security  as  outlined  in  the 
reference  for  Joseph  and  Avizienis  above. 

L.  Lamport,  R.  Shostak,  and  M.  Pea.se,  The  Byzantine  Generals  Problem,  ACM.  Trans,  on 
Programming  Lang,  and  Sys.  4,3  (July  1982),  382-401. 
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Description  of  the  "Byzantine  generals"  problem,  which  is  essentially  the  problem  of  reaching 
agreement  among  a  group  of  processors  in  the  presence  of  faults  in  its  members.  The  paper 
proves  that,  if  oral  (that  is,  unreliable)  messages  are  used  for  communication,  withstanding  m 
failures  requires -a  minimum  of  3m+l  processors,  then  presents  a  solution  to  the  problem  that 
caii  be  implemented  with  the  minimum  number  of  processors.  A  second  solution  that  uses 
written  and  signed  messages  (analogous  to  authenticated  or  tagged  messages)  allows  a  reduc¬ 
tion  in  the  number  of  processors  needed  to  withstand  a  given^  number  of  faulty  processors. 
The  Byzantine  generals  problem  is  an  example  of  maintaining  mutual  replicated  consistency 
among  a  group  of  processors.. 

CD.  Locke,  Best-Effort  Decision  Making  for  Real-Time  Scheduling,  Ph.D.  Thesis,  Depanmerit 
of  Computer  Science,  Garnegie-Mellon  University  (May  1986). 

A  description  of  scheduling  algorithms  in  real-time  systems,  including  the  best-effort 
scheduler  used  in  Alpha.  This  provides  an  example  of  how  an  objective  function  (in  this 
case,  the  time-value  function)  can  be  u.sed  in  a  system  to  evaluate  the  ability  of  the  system  to 
meet  its  requirements  (in  this  case,  the  real-time  deadlines).  When  the  system  load  increases 
to  the  point  that  some  degradation  is  required,  the  best-effon  scheduler  uses  the  time-value 
functions  for  all  tasks  to  maximize  the  "value"  of  the  system  with  respect  to  the  task  schedule 
chokn,  rather  than  scheduling  strictly  on  priority  or  criticality  as  is  typically  done. 

S.R.  Mahaney  and  F.B.  .Schneider,  Inexact  Agreement:  Accuracy,  Precision,  and  Graceful 
Degradation,  Proc.  4tli  Ann.  ACM  Symp.  on  Principles  of  Dist.  Computing  (August  1985), 
ACM  Press,  237-249. 

Two  protocols  that  allow  processes  to  "agree"  on  a  value  without  requiring  that  the  value  be 
the  same  in  both.  The  protocols  let  the  processes  exchange  their  values  and  use  them  to  com¬ 
pute  new  ones  that  should  be  closer  to  both  the  actual  value  and  to  each  other’s.  As  long  as 
fewer  than  one-third  of  the  processes  are  faulty,  then  these  protocols  will  result  in  conver¬ 
gence  for  the  value  (if  not,  then  the  divergence  is  bounded).  This  is  yet  another  example  of 
levels  of  consistency. 

R.  Obenriarck,  Distributed  Deadlock  Detection  Algorithm,  ACM  Trans,  on  Database  Sys.  7,2 
(June  1982),  187-208. 

An  algorithm  for  detecting  deadlock  in  distributed  transactions.  Centralized  algorithms 
require  a  site  to  be  in  contact  with  all  other  users,  while  many  distributed  algorithms  replicate 
the  "wait-for"  graph  u.sed  to  detect  deadlock  at  all  sites.  This  scheme  splits  the  detection 
graph  among  the  sites,  providing  different  views  of  the  processing  at  each.  Local  deadlocks 
can  be  broken  locally,  and  only  global  deadlocks  involving  transactions  at  remote  sites  need 
to  be  coordinated.  This  algorithm  is  susceptible  to  false  alarms,  so  it  may  abort  some  transac¬ 
tions  needlessly.  An  example  of  mutual  non-replicated  consistency. 

M.  Pease,  R.  Shostak,  and  L.  Lampon,  Reaching  Agreement  in  the  Presence  of  Faults,  Jour¬ 
nal  of  the  ACM  27,2  (April  1980),  228-234. 

The  original  proof  that  algorithms  for  interactive  consistency  among  a  group  of  n  processors 
with  m  or  fewer  faulty  processors  can  only  be  devised  if  n  is  3m+l  or  greater.  The  definition 
of  interactive  consistency  given  here  is  that  all  nonfaulty  processors  must  come  to  a  consistent 


3-29 


view  prthe  values  held  by  all  processors,  including  the  faulty  ones.  Note  that  the  faulty  pro¬ 
cessors  afe.not  constrained  by  this  requirement,,  which  is  then  weaker  than  a  strict  (and  unat- 
;tainabie,  in  all  likelihobd). requirement  for  consistency  in  all  processors,  faulty  or  not. 

3.  DESIGI^ING  DEPENDABLE  DISTRIBUTED  SYSTEMS 

Design  paradigms 

A.  Avizicnis  icdi),  Application  of  Fault  Tolerance  Technology:  Design  of  Fault-Tolerant  Sys¬ 
tems,  BM/C3.  Algorithm  and  Processor,  Working  Group  Report,  Rome  Air  Development 
Center,  Griffiss  AFB,  New  Y.ork  (October  1989). 

A  report  prepared  for  RADC  based  on  the  work  performed  by  a  group  of  experts  in  1986- 
1987.  ^e  report  discusses  many  of  the  important  issues  in  the  application  of  fault  tolerance 
to  distributed  systems.  The  most  interesting  discussion  centers  around  a  design  paradigm  for 
the  development  of  dependable  systems,  the  essence  of  which  is  the  panitioning  of  a  system 
design  into  subsystems  for  which  the  error  detection  and  fault  management  techniques  are 
individually  devised  and  evaluated.  While  it  is  hard  to  cal}  this  a  radical  notion,  the  idea  of  a 
design  paradigm  that  supports  a  structured  method  for  the  development  of  dependable  systems 
is  imponant.  From  a  software  engineering  point  of  view,  it  is  impossible  to  develop  truly 
dependable  systems  unless  the  development  process  itself  is  dependable  and  evaluatable. 

F.G.F.  Davis  and  R.E.  Ganteribein,  Responding  to  Catastrophic  Errors:  A  Design  Technique 
for  Fault-Tolerant  Software,  Journal  of  Sys.  and  Software  (to  appear  1991). 

A  design  paradigm  for  systems  that  may  experience  catastrophic  errors.  The  intent  is  to  iden¬ 
tify  those  failures  that  cannot  be  handled  in  a  localized  manner  and  then  design  the  system  to 
avoid  their  occurrence. 

I.  Dunham,  Abstraction  and  Methodical  Development  of  Fault-Tolerant  Software,  Tech. 
Repon  CMU-CS-86-112,  Carnegie-Mellon  Univ.  (1986). 

A  Ph.D.  thesis  from  CMU  describing  a  structured'  approach  to  fault-tolerant  software  design. 
This  is  one  of  the  few  documented  attempts  to  use  software  engineering-like  techniques  for 
dependable  software.  Unfortunately,  the  only  place  I’ve  seen  it  mentioned  is  in  an  IBM 
University-Level  Computer  Science  course  on  fault-tolerant  software  that  1  teach. 

N.G.  Levcson,  Builciing  Safe  Software,  in  Software  Reliability:  Achievement  and  Assessment 
(B.  Littlewood,  ed.),  Blackwell  Scientific  Publishing,  Oxford,  1987. 

An  approach  to  designing  systems  with  the  emphasis  of  pre.serving  safety  as  an  attribute  as 
well  as  the  other  objectives  nomially  considered.  In  safety-critical  systems,  operational  failure 
can  result  in  damage  to  propeny,  personnel,  or  the  environment.  The  approach  described  here 
uses  risk  factors  to  determine  where  the  need  for  dependability  (either  in  fault  avoidance  or 
tolerance)  is  greatest.  While  intended  more  for  control  or  avionics  applications  than  surviv- 
able  systems,  this  approach  has  merit  in  its  emphasis  on  effective  application  of  fault  manage¬ 
ment  as  a  means  to  prevent  undesirable  behavior  instead  of  a  means  to  preserve  the  status 
quo. 

Hardware  designs 
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R.E,  Harper,  J.H.  Lala,  and  JJ.  Deyst,  Fault  Tolerant  Parallel  Processor  Architecture  Over¬ 
view,  Proc.  JStli  hit.  Symp:  on  Fault-Tolerant  Computing,  (June  1988),  IEEE  Comp.  Soc. 
Press,. 252-257; 

An  early  description  of  the  FTPP  from  Draper  Labs.  In  many  systems,  reliability  and 
throughput  are  seen  as  conflicting  requirements,  so  this  system  was  designed  using  clusters  of 
multiprocessors.  Of  special  interest  here  is  the  recognition  that  mutual  synchrony  is  difficult 
to  achieve  with- any  efficiency,  so  they  support  nonhomogenous /Mwcr/ona/  synchrony  to  coor¬ 
dinate  equivalent  execution  among  the  clusters. 

J.H.  Lala  and  L.S.  Alger,  Hardware  and  Software  Fault  Tolerance:  A  Unified  Architecture 
Approach,  Proc.  ISth  hit.  Symp.  on  Fault-Tolerant  Computing,  (June  1988),  IEEE  Comp.  Soc. 
Press,  .240-245. 

A  design  based  on  clusters  of  multiprocessors  that  tolerates  arbitrary  hardware  faults  and 
efficiently  implements  N-version  programming.  This  design  adds  an  application  processor  to 
each  multiprocessor  cluster  so  that  hardware  and  software  faults  can  be  differentiated  and  iso¬ 
lated.  Masking  is  carried  out  by  a  configuration  voter  that  carries  the  error  history  of  each 
version,  so  that  after  a  while  it  "learns"  which  versions  are  triisiwonhy  and  will  decide  on  that 
basis  rather  than  on  a  strict  plurality. 

J.H.  Lala,  R.E.  Harper,  and  L.S.  Alger,  A  Design  Approach  for  Ultrareliable  Real-Time  Sys¬ 
tems,  Computer  24,5  (May  1991),  12-22. 

A  description  of  approaches  to  masking  errors  and  achieving  congruency  among  redundant 
hardware  components.  The  context  of  the  discussion  is  the  design  of  architectures  that  can 
tolerate  a  single,  arbitrary  hardware  fault  as  well  as  some  double  simultaneous  faults  without 
complete  proces.sor  redundancy.  (Note:  I  admit  to  being  little  concerned  with  strictly 
hardware-based  techniques  for  fault  tolerance,  which  is  why  this  section  is  so  sparsely  popu¬ 
lated.  The  basic  issues  of  hardware  fault  tolerance  are  described  in  some  of  the  overview 
papers,  and  management  of  multiple  processors  is  included  under  processing  in  the  previous 
section.  However,  this  paper  is  a  demonstration  that  fail-stop  processors  do  indeed  exist,  mak¬ 
ing  higher-level  techniques  feasible.) 

Software  designs 

T.E.  Bihari  and  K.  Schwan,  A  Comparison  of  Four  Adaptation  Algorithms  for  Increasing  the 
Reliability  of  Real-'fimc  Software,  Proc.  Real-Time  Sys.  Symp.  (December  1988),  IEEE 
Comp.  Soc.  Press,  232-241. 

A  description  of  an  adaptive  fault-tolerance  manager  in  a  real-time  system.  The  manager 
incorporates  a  uniform  model  of  fault-tolerance  techniques  and  provides  tools  to  implement 
algorithms  for  choosing  and  performing  adaptation  in  real  time.  The  adaptations  include 
changes  in  the  .softwtire  configuration,  its  internal  structure  and  function,  or  the  software-to- 
hardware  mapping.  The  system  schedules  fault  tolerance  (which  may  consist  of  redundant 
software,  retry,  or  assignment)  to  favor  those  components  that  have  the  highest  value  of  prior¬ 
ity  times  number  of  failures  (this  is  the  objective  function).  Various  combinations  of  fault- 
tolerance  algoiithms  were  tried,  and  the  paper  summarizes  the  average  service  value  lost  per 
fault  in  each  ca.se.  .\lthuugh  the  number  of  adaptations  possible  is  small,  this  is  an  example 
of  what  adaptive  fault  mtinagement  can  achieve. 
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A.K;  Caglayan,  P.R.  Lorczak,  arid  D.E.  Eckhardt,  An  Experimental  Evaluation  of  Software 
Diversity  in  a  Fault-Tolerant  Avionics  Application,  Proc.  7th  Symp.  on  Reliable  Dist.  Sys. 
(October  1988),  IEEE  Coriip;  Soc.  Press,  63-70. 

Algorithm  diversity  as  applied  to  redundancy  management  software  for  a  fault-tolerant  sensor 
array.  The  study  shows  the  performance  gains  achieved  from  using  three  diverse  algorithms 
in- parallel  for  error  detection  and  isolation.  The  gain.s  were  mostly  due  to  the  elimination  of 
false  alarms.  The  paper  contains  a  useful  discussion  of  "median"  versus  "majority"  voting  to 
detect  and  mask  errors. 

W.R.  Dunn  and  L.D.  Corliss,  Software  Safety:  A  User’s  Practical  Perspective,  Proc.  Ann. 
Reliability  and  Maintainability  Symp.  (January  1990),  IEEE,  430-435. 

An  evaluation  of  software  projects  at  the  NASA  Ames  Research  Center.  Although  the  pro¬ 
jects  described  here  have  exhibited  high  reliability  in  practice,  the  authors  conclude  that  the 
design  processes  for  dependable  software  must  also  be  improved  if  any  kind  of  guarantee  is  to 
be  given.  The  examples  demonstrate  how  ultrareliability  was  achieved  by  fault  avoidance 
through  design  adapt:ition  (iri  other  words,  they  ran  the  system  to  see  how  it  failed  and  then 
adapted  the  design  or  the  implementation  to  prevent  that  failure  from  occurring  again). 

O.  Gudmundsson  et  al.,  MARUTI:  A  Hard  Real-Time  Operating  System,  Proc.  2nd  IEEE 
Workshop  on  Experimental  Dist.  Sys.  (October  1990),  IEEE  Comp.  Soc.  Press,  29-34. 

An  overview  of  a  distributed  operating  system  designed  at  Maryland  to  reliably  support  hard 
real-time  applications.  The  system  provides  fault-tolerant  operation  through  mechanisms  that 
can  be  used  to  implement  a  number  of  policies.  The  system,  which  has  been  prototyped  on 
top  of  UNIX,  provides  for  the  definition  of  objects  whose  description. in  the  system  contains  a 
specification  of  the  fault  tolerance  technique  u.sed.  It  isn’t  clear  from  the  paper,  however, 
whether  the  technique  used  for  an  object  is  dynamic  or  static. 

M.D.  Hansen,  Survey  of  Available  Software-Safety  Analysis  Techniques,  Proc.  Ann.  Reliabil¬ 
ity  and  Maintainability  .Symp.  (January  1989),  IEEE,  46-49. 

A  survey  of  techniques  for  conducting  software  safety  analysis.  These  techniques  are  obvi¬ 
ously  adapted  from  NSCCA  hardware  analysis  techniques  (what’s  the  point  of  calling  a 
software  design  review  "sneak  circuit  analysis?"),  but  make  the  point  that  structured  reviews 
of  code  can  eliminate  a  number  of  faults  before  they  become  operational.  The  paper  suggests 
that  Petri  net  modeling  and  simulation  may  be  promising  techniques  for  analyzing  (and 
perhaps  evaluating)  a  number  of  objectives,  including  performance,  correctness,  fault  toler¬ 
ance,  and  safety. 

E.D.  Jensen  and  J.D.  Northeutt,  Alpha:  A  Non-proprietary  OS  for  Large,  Complex,  Distri¬ 
buted  Real-Time  Systems,  Proc.  2nd  IEEE  Workshop  on  Experimental  Dist.  Sys.  (October 
1990),  IEEE  Comp.  Soc.  Press,  35-41. 

A  concise  and  easily  accessible  reference  for  the  Alpha  operating  system  developed  at 
Camegie-Mellon.  Alpha  has  many  properties  that  support  the  development  of  survivable  dis¬ 
tributed  systems,  including  the  provision  of  time-value  functions  which,  as  mentioned  in  the 
previous  section,  are  an  example  of  an  implementable  objective  function  for  performance, 
functionality,  and  precision  with  re.spect  to  scheduling. 
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B.  Littlewood  and  D.R.  Miller,  A  Conceptual  Model  of  Multi-Version  Software,  Proc.  17th 
Iht.  Symp.  on  Faiilt-Tokmnt  Computing  (June  1987)j  IEEE  Comp.  Soc.  Press,  150-155. 

A  study  of  how  diversity  of  development  methods,  affects  the  resilience  of  multi- version 
software  systems.  The  value  of  diverse  development  -is  shown  to  depend,  on  the  characteris¬ 
tics  of  the  system,  particularly  in  the  effects  of  similar  errors. 

T.P.  Ng,  The  Design  and  Implementation  of  a  Reliable  Distributed  Operaring  System  -  ROSE, 
Proc.  9th  IEEE  Symp.  on  Reliable  Dist.  Sys.  (1989),  IEEE  Comp.  Soc.  Press,  2-11. 

A  brief  description  of  an  experimental  distributed  operating  system  being  developed  at  Illi¬ 
nois.  The  system,  which  is  based  on  the  V  kernel  with  extensions,  provides  tasks  that  can 
detect  when  other  tasks  have  become  unreachable  and  an  abstraction  of  replicated  tasks  that 
can  be  utilized  by  applications.  Of  particular  interest  is  the  guarantee  function,  which  guaran¬ 
tees  that  a  task  will  be  in  a  state  consistent  with  having  sent  a  message  (unless  it  has  been  ter¬ 
minated  due  to  a  sufficient  number  of  failures  in  sending  the  message)  and  thus  supports 
mutual  consistency  among  the  tasks. 


K.  Schwan,  A.  Gheith,  and  H.  Zhou,  From  CHAOSba.se  to  CHAOSarc;  A  Family  of  Real- 
Time  Kernels,  Proc.  Real-Time  Sys.  Symp.  (December  1990),  IEEE  Comp.  Soc.  Press,  82-91. 

Presents  a  family  of  object-based  operating  system  kernels  that  are  extensible  and  customiz¬ 
able  to  a  variety  of  real-time  environments.  The  major  feature  of  this  family  of  CHAOS  ker¬ 
nels  is  the  support  of  object  abstractions  that  can  be  adaptively  specified  with  respect  to  a 
number  of  attributes.  The  external  interface  to  any  CHAOS  object  is  a  policy  abstraction  that 
interprets  the  current  specification  and  invokes  the  object  through  one  of  its  multiple  entry 
points.  While  not  directly  related  to  fault  management,  this  approach  may  serve  as  a  method 
for  implementing  tidaptivity. 

4.  RELIABILITY  M()1)ELIN(; 

Overview  and  examples 

V.R.  Basil!  and  B.T.  Pcrricone,  Software  Errors  and  Complexity:  An  Empirical  Investigation, 
Comm.  ACM  21, \  (January  1984),  42-52. 

An  analysis  of  the  di.stributions  and  relationships  derived  from  change  data  collected  during 
the  development  of  several  sy.stems  between  August  1977  and  May  1980.  The  information 
collected  includes  changes  to  the  system  and  repair  of  detected  errors.  The  major  findings 
were  that,  first,  the  majority  of  detected  errors  were  due  to  misunderstanding  of  the 
specifications  (!)  anil,  second,  that  the  size  of  a  module  was  not  correlated  to  its  proneness  to 
errors. 

V.R.  Basil!,  Recent  Advances  in  Software  Measurement,  Proc.  12th  Int.  Cunf.  on  Software 
Eng.  (March  1990),  IEEE  Comp.  Soc.  Press,  44-49. 

An  abstract  of  a  talk  given  at  the  conference.  The  paper  doesn’t  have  much  to  say,  but  there 
is  an  extensive  bibliography  on  software  reliability  measurement  included. 
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M;W;  Bush,  Getting  Started  on  Metrics  --  Jet  Propulsion  Laboratory  Productivity  and  Quality, 
Pwc.  12tli'lnt.  Conf.  on  Software  Eng.  (March  1990),  IEEE  Comp.  Soc.  Press,  133-142. 

A  rejpoit  on  JPL’s  experience  with  collecting  and  analyzing  data  on  the  software  development 
process.  The  collection  process  was  part  of  a  quality  improvement  effort  and  was  intended  to 
provide  baselines  for  quality  and  productivity  measures.  Included  in  the  paper  are  some 
figures  oh  defects  per  lines  of  code  in  systems  developed  by  JPL,  IBM,  and  RADG. 

K.C:  Ferrara,  S.J.  Keene,  and  C.  Lane,  Software  Reliability  from  a  System  Perspective,  Proc. 
Am.  Reliability  and  Maintainability  Symp.  (January  1989),  IEEE,  332-336. 

A  attempt  to  combine  hardware  and  software  reliability  metrics  to  yield  a  system  estimate. 
The,  paper  documents  a  number  of  approaches  to  measuring  reliability  used  in  IBM  to  both 
evaluate  and  improve  system  products.  An  interesting  observation  is  that  in  many  cases  the 
act  of  measuring  development  alone  was  seen  to  improve  the  process  significantly. 

A.L.  Goel,  Software  Reliability  Models,  IEEE  Trans,  on  Software  Eng.  SE-11,12  (December 
1985),  1411-1423. 

A  survey  (partially  funded  by  RADC)  of  approaches  to  modeling  reliability  in  software  sys¬ 
tems.  The  paper  includes  an  a.s.sessment  of  the  limitations  and  the  assumption.s  of  several 
models  and  proposes  an  tipproach  for  fitting  a  model  to  a  problem.  The  examples  use  failure 
data  from  a  C2  system  developed  at  RADC.  A  classic  paper. 

D.I.  Heimann,  N.  Mittal,  and  K.S.  Trivedi,  Dependability  Modeling  for  Computer  Systems, 
1991  Proc.  Animal  Reliability  and  Maintainability  Symp.  (January  1991),  120-128. 

An  up-to-date  survey  of  measures  and  models  for  hardware  dependability.  The  paper 
describes  the  three  basic  measures  of  dependability  (availability,  reliability,  and  task  comple¬ 
tion)  and  di.scusses  how  dependability  analysis  makes  use  of  them.  A  discussion  of  parameter 
estimation  for  the  models  is  also  included. 

S.J.  Keene,  Cost-Effective  Software  Quality,  Proc.  Ann.  Reliability  and  Maintainability  Symp. 
(January  1991),  IEEE,  433-437. 

Outlines,  the  use  of  metrics  in  software  development  at  IBM.  Three  methods  of  software  reli¬ 
ability  estimation  -  precedent  code,  fault  content  modeling,  and  time  domain  modeling  --  are 
described.  Other  techniques  for  improving  quality  both  in  .software  and  in  the  development 
process  are  discussed. 

H.D.  Mills  and  P.B.  Dy.son,  Using  Metrics  to  Quantify  Development,  IEEE  Software  (March 
1990),  15-16. 

The  guest  editors’  introduction  to  a  special  issue  on  software  metrics.  The  whole  problem  of 
predicting  software  reliability  is  still  controversial,  partly  because  it  is  still,  as  the  authors 
describe  it,  an  "adolescent  activity."  Reliability  measurement  in  hardware  is  well  defined,  but 
is  much  less  so  in  software.  The  suggestion  from  the  authors  is  to  define  a  framework  for 
measurement  on  a  project-specific  basis  that  allows  you  to  get  results  in  a  form  that  can  be 
used  to  monitor  the  development  process  and  thus  detect  or  predict  problems. 
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J.i).  Musa,  The  Measurement  and  Management  of  Software  Reliability,  Proc.  of  the  IEEE 
68,9  (September  1980)^.  1131-1143. 

A  survey  and  history  of  software  reliability  modeling  from  1967-1980.  Although  older  and 
not  as  compiete  as  Musa’s  1987  text  (referenced  below),  this  paper  compares  software  and 
hardw^e  reliability  models  and  explains  how  the  software  models  can  be  used  to  predict  the 
behavior  of  an  operational  system  based  on  infoiTnation  from  its  development  and  the  code. 
A  good'summary  of  the  early  development  of  models. 

J.D.  Musa,  A.  lannino,  and  K,  Okumoto,  Software  Reliability:  Measurement,  Prediction, 
Application,  McGraw-Mill ,( 1987). 

The  single  most  useful  reference  for  reliability  modeling  of  software,  both  for  its  application 
Md  the  underlying  theory.  The  emphasis  of  this  text  is  on  the  use  of  reliability  measures  for 
planning  and  evaluation  development,  but  it  may  be  that  some  of  the  measures  presented  here 
and  elsewhere  can  be  used  for  detection  and  adaptation  in  operational  systems  as  well. 

J.D.  Musa,  Quality  Time:  Faults,  Failures,  and  a  Metric  Revolution,  IEEE  Software  (March 
1989),  85+. 

A  short  editorial  on  the  shift  of  quality  measurements  from  the  point  of  view  of  the  developer 
(to  whom  faults  are  of  utmost  concern)  to  that  of  the  user  (who  is  worried  about  failures). 
"Quality"  is  a  word  that  gets  bandied  about  by  lots  of  people,  most  of  whom  really  don’t 
know  what  they  mean  by  it.  From  the  u.ser’s  point  of  view,  quality  means  freedom  from 
failure,  and  Musa  argues  that  systems  should  be  evaluated  in  terms  of  failures  per  hours  of 
operation,  rather  tlian  in  faults  per  lines  of  code.  He  aUso  suggests  that  an  operational  profile 
of  a  system  (a  list  of  the  system  functions  and  the  proportion  of  time  spent  executing  in  each) 
is  a  useful  tool  for  debugging  a  system  as  well  as  evaluating  it.  In  particular,  it  can  be  used 
to  reduce  the  failure  rate  in  systems  where  usage  of  functions  is  nonuniform  (i.e.,  make  sure 
that  the  functions  you  use  most  work  best). 

Modeling  tcclmitiucs 

J.  Arlat,  K.  Kanoiin,  and  J.-C.  Laprie,  Dejjendability  Modeling  and  Evaluation  of  Software 
Fault-Tolerant  Systems,  IEEE  Trans,  on  Computers  39,4  (April  1990),  504-513. 

Models  that  describe  reliability  and  safety  in  software-fault  tolerant  architectures,  including 
recovery  blocks  and  N-version  programming.  The  studies  summarized  here  consider  both 
independent  and  reliited  faults  and  methods  for  combining  the  approaches  to  handle  multiple 
faults.  The  results  of  the  modeling  indicate  that  related  faults  between  the  variants  in  mul¬ 
tiversion  architectures  have  a  significant  impact  on  safety,  while  the  nesting  of  recovery -based 
architectures  shows  less  improvement  in  reliability  than  other  approaches.  Another  approach 
analyzed  was  the  tli.^t-'iirding  of  a  variant  that  frequently  disagrees  with  the  others  in  a  mul- 
tiversion  aichltociure-,  the  study  indicated  that  this  reduction  in  functionality  is  always 
beneficial  with  respect  to  safety,  and  to  reliability  in  the  case  where  independent  faults  dom¬ 
inate. 

W.K.  Ehrlich,  S.K.  Lee,  and  R.H.  Molisani,  Applying  Reliability  Measurement:  A  Case  Study, 
IEEE  Software  (March  1990),  56-64. 
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An  example. of  using  failure  data  to  validate  a  reliability  model  based  on  histoncal  data.  The 
testbed  was  the  AT&T  telecommunication  testing  system  RMS-Dl. 

J.K.  Muppala,  S.P.  Woolet,  and  K.S.  Trivedi,  Real-Time  Systems  Performance  in  the  Presence 
of  Failures,  Computer  24,5  (May  1991),  37-47. 

Presents  a  unified  model  for  evaluating  performance,  reliability/availability,  and  deadline  vio¬ 
lation  based  on  stochastic  reward  nets  and  Markov  chains.  The  model  gives  a  response-time 
distribution  for  a  queuing  network.  Throughputs  and  response  time  are  modeled  as  reward 
rates.  Examples  include  predicting  the  size  of  the  Markov  chains  (useful  since  the  response 
times  decrease  with  the  number  of  processors  involved  in  a  computation). 

Y.  Nakagawa  and  S.  Hanata,  An  Error  Complexity  Model  for  Software  Reliability  Measure¬ 
ment,  Proc.  Util  lilt.  Conf.  on  Software  Eng.  (May  1989),  IEEE  Comp.  Soc.  Press,  230-236. 

A  time-dependent  model  for  systems  with  complex  errors  developed  by  Nippon  Telephone 
and  Telegraph.  The  model  predicts  the  number  of  remaining  errors  in  software  based  on  the 
number  of  detected  errors  and  the  ratio  of  complex  to  simple  errors.  The  model  has  a  better 
fit  than  other  models  on  the  data  set  presented,  although  more  validation  is  needed  before  it 
can  be  used. 

J.S.  Ostroff  and  W.M.  Wonham,  Modeling,  Specifying,  and  Verifying  Real-Time  Embedded 
Computing  Systems,  Proc.  Real-Time  Sys.  Symp.  (December  1987),  IEEE  Comp.  Soc.  Press, 
124-132. 

Proposes  a  framework  for  modeling  and  verifying  hard  real-time  systems  using  extended-state 
machines.  A  specification  language  is  described  that  was  developed  for  use  in  this  framework 
and  provides  an  abstract  operational  and  axiomatic  semantics  for  programming  constructs. 

C.V.  Ramamooriliy  and  F.B.  Bastani,  Software  Reliability  -  Status  and  Perspectives,  IEEE 
Trans,  on  Software  Eng.  SE-8,4  (July  1982),  354-371. 

A  review  of  models  based  on  the  residual  error  size  predicted  and  the  testing  process  used. 
The  paper  also  describes  methods  for  estimating  program  reliability  and  the  adequacy  of  test 
cases  based  on  equivalence  classes  in  the  input  domain.  The  method  is  expensive  to  use,  but 
is  useful  for  ultrarcliable  systems  in  which  failure  data  is  difficult  to  collect. 

R.D.  Schlichting,  A  Technique  for  Estimating  Performance  of  Fault-Tolerant  Programs,  IEEE 
Trans,  on  Software  Eng.  SE-1 1,  6  (June  1985),  555-563. 

Presents  models  for  estimating  the  performance  of  fault-tolerant  programs  executing  on  fail- 
stop  processors.  Tlic  models  use  discrete-time  Markov  chains  and  z-transforms  to  derive  a 
probability  distribution  for  time  to  completion  based  on  numbers  of  failures.  The  results  are 
useful  for  predicting  a  program’s  ability  to  meet  real-time  deadlines  in  the  presence  of  failures 
in  the  system. 


Y.-B.  Shieh,  D.  Ghosal,  and  S.K.  Tripathi,  Modeling  of  Fault-Tolerant  Techniques  in  Distri¬ 
buted  Systems,  Proc.  19tli  hit.  Symp.  on  Fault-Tolerant  Computing  (June  1989),  IEEE  Comp. 
Soc.  Press,  167-174. 
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An,  evMuatibn  of  arbitrary  versus  planned  checkpointing  in  both  centralized  and  distributed 
systems.  The  checkpointing  protocol  is  modeled  using  Petri  nets.  The  results  for  the  distri¬ 
buted  sphenie  show  that  as  the  synchronization  interval  decreases,  arbitrary  checkpointing  is 
faster  than  planned  (coordinated)  checkpointing  due  to  the  overhead  of  the  processes  taking 
their  checkpoints  within  a  given  interval.  In  addition  to  the  Petri  net  modeling,  this  paper  is 
interesting  as  an  example  of  adaptive  specification,  since  it  suggests  that  planned  checkpoint¬ 
ing  is.  most  effective  as  long  as  the  synchronization  interval  remains  above  a  specified  thres¬ 
hold. 


R.M.  Smith,  K.S.  Trivedi,  and  A.V.  Ramesh,  Performability  Analysis:  Measures,  an  Algo¬ 
rithm,  and  a  Ca.se  Study,  IEEE  Trans,  on  Computers  37,4  (April  1988),  406-417. 

Models  and  algorithms  for  determining  performability  (a  combined  measure  of  performance 
arid  reliability)  in  heterogenous  distributed  systems,  both  with  and  without  repair.  The  sys¬ 
tems  are  modeled  as  Markov  chains  with  reward  rates  associated  with  each  state.  The  results 
indicate  that  distributions  of  cumulative  performance  measures  over  finite  intervals  reveal 
behaviors  not  indicated  by  either  steady-state  models  or  the  expected  values  alone,  making 
such  a  measure  an  attractive  possibility  for  an  objective  function. 

M.  Takahashi  and  Y.  Kamayachi,  An  Empirical  Study  of  a  Model  for  Program  Error  Predic¬ 
tion,  Proc.  Sth  lilt.  Conf.  on  Software  Eng.  (August  1985),  IEEE  Comp.  Soc.  Press,  330-336. 

Presents  a  model  to  estimate  the  number  of  errors  in  a  program  prior  to  testing,  using  environ¬ 
mental  and  development  factors  identified  through  regression  analysis.  The  results  show  that 
the  frequency  of  specilication  change,  the  skill  of  the  developer,  and  the  volume  of  the  design 
documentation  contribute  significantly  to  the  number  of  errors  in  software  and  can  be  used  to 
give  a  more  accurate  prediction  of  the  reliability  of  software  than  size  measures  alone. 

Automated  tools 

R.  Geist  and  K.  Trivedi,  Reliability  Estimation  of  Fault-Tolerant  Systems:  Tools  and  Tech¬ 
niques,  Computer  Th,!  (July  1990),  52-61. 

A  review  of  tools  iind  techniques  for  reliability  estimation.  Among  the  tools  profiled  is 
CARE,  which  is  described  in  more  detail  in  Chapter  9  of  the  Pradhan  text  (Volume  11)  refer¬ 
enced  in  the  overview  section. 


A.  Goyal  et  al..  The  System  Availability  Estimator,  Proc.  16th  Int.  Symp.  on  Fault-Tolerant 
Computing  (July  1986),  IEEE  Comp.  Soc.  Press,  84-89. 

Description  of  the  SAVE  package,  which  is  used  to  construct  probabilistic  moaels  of  availa¬ 
bility  and  reliability.  The  system,  intended  to  be  used  during  system  design  and  configuration, 
allows  parameters  to  be  defined  separately  for  each  component  in  the  system.  The  parameters 
may  include  number  of  spares,  the  spares  failure  rate,  the  module  that  both  operation  and 
repair  depend  upon,  the  component  failure  rale  (actual  or  probability),  and  a  list  of  affected 
components. 

J.D.  Musa,  Tools  for  Measuring  Software  Reliability,  IEEE  Spectrum  26,2  (February  1989), 
39-42. 
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Sortie  commeh IS  on  programs  that  exist  for  computing  software  reliability  using  Musa’s  basic 
and  loMritHmic  Poisson  models,  as  described  ifi  his  text.  In  general,  the  models  are  useful  for 
;prOject  management  (predicting  release  dates,  etc.)  but  are  not  sufficiently  precise  for  ultrareli- 
able;  systenis.  An  interesting  sidebar  is  the  description  of  an  approach  to  "reliability  engineer¬ 
ing"  that  uses  reliability  models  to  help  identify  the  problem  to  be  solved  and  the  inform/rtion 
that  should  be  collected  throughout  the  system’s  life  to  verify  its  reliability. 


5.  ADDITIONAL  SOURCES  OF  INFORMATION 

This  bibliography  is  not  intended  to  be  exhaustive.  Rather,  it  contains  references  that  are  par¬ 
ticularly  relevant  to  the  issues  discussed  in  the  preceding  report.  More  information  on 
dependable  distributed  computing  can  be  found  from  the  sources  listed  below, 

Annual  conferences  with  a  primary  focus  on  dependability 

International  Symposium  on  Fault-Tolerant  Cornputing 
IEEE  Symposium  on  Reliable  Distributed  Systems 

(formerly  Symposium  on  Reliability  in  Distributed  Software  and 
Databttse  Systems) 

Reliability  and  Maintainability  Syrnposium 

Pacific  Rim  International  Symposium  on  Fault-Tolerant  Systems  (Japan) 

International  Symposium  on  Fault-Tolerant  System  Design  (Bulgaria) 

Other  conferences  with  occasional  papers  on  dependability 

ACM  Symposium  on  Principles  of  Di.stributed  Computing 
Real-Time  Systems  Syifiposium 

International  Conference  on  Distributed  Computing  Systems 
International  Conference  on  Software  Engineering 


Journals  where  articles  on  dependability  often  appear 


IEEE  Transactions  on  Reliability 

IEEE  Transactions  on  Computers 

IEEE  Transactions  on  Software  Engineering 

IEEE  Transactions  on  Parallel  and  Distributed  Systems 

Communications  of  the  ACM 

Computer 

(special  issues  on  fault-tolerant  computing  were  published  in 
July  1990  and  August  1984;  other  papers  appear  occasionally) 


It  is  interesting  to  note  that  the  August  1991  issue  of  Byte  contains  several  articles  relating  to 
dependability,  particidaiiy  mirrored  disk  systems  for  reliably  maintaining  data.  This 
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movement  into.  the.  inicrocoiriputer  arena,  as  well  as  into  the  "popular"  literature,  could  indi- 
catesa-  wider. audicnce.and,  increased  interest.in  dependability  issues. 
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ANALYSIS  OF  THE  ELECTROMIGRATION-INDUCED  FAILURE 
IN  THE  VLSI  INTERCONNECTION  COMPONENTS  AND 
THE  MULTISECTION  INTERCONNECTIONS 


Ashok  K.  Gael.  Assistant  Professor,  ard 
Matthew  M.  Leipmtz,  Graduate  Student 

D^aitmeot  of  Ekctiical  Engiiieeiing 
Mktigan  Tedmolo^cal  Univetsity 
Hoaghtoo,  MI  49931 

ABSTRACT 

We  have  carried  out  a  first-order  analysis  of  the  electromigration 
induced  figure  effects  in  the  various  VLSI  interconnection  components 
incluritng  the  multisection  interconnections  using  the  series  model  for 
failure  mechanism.  Hie  Components  include  a  straight  interconnection 
segmentj  an  interconnection  bend,  an  interconnection  step,  an  intercon¬ 
nection  plug,  an  interconnection  via,  an  interconnection  overflow,  a  hor¬ 
izontal  multisection  interconnection,  a  vertical  multisection  interconnec¬ 
tion,  a  mixedl  multisection  interconnection,  and  a  power/ground  bus. 
First,  by  considering  the  effect  of  average  flux  density  on  the  grain- 
boundaiy  migration,  we  have  reduced  each  interconnection  component 
in  to  a  series  or  series-parallel  combination  of  straight  segments.  Then, 
for  each  of  the  components,  we  have  investigated  the  dependence  of  the 
median-time-to-failure  and  the  Log-normal  standard  deviation  of  the 
corresponding  failure  distribution  on  the  various  component  parameters. 
The  results  can  be  utilized  to  choose  optimum  values  of  the  component 
parameters  for  minimum  probability  of  interconnection  failure  due  to 
electromigration. 
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1.  INTRODUCTION 


Contihuous  ^vances  in  the  field  of  VLSI  are  resulting  in  smaller  integrated  cir¬ 
cuit  chips  having  millions  of  interconnections  that  integrate  the  components  on  the 
chip.  Interconnection  failure  caused  by  electromigration  is  one  of  the  major  factors 
responsible  for  lowering  the  effective  lifetime  of  the  ch^  [1-3].  'Iherefore,  it  is  very 
important  to  understand  the  dependence  of  the  electromigration  induced  interconnec¬ 
tion  failure  on  the  various  interconnection  parameters.  In  fact,  in  the  past,  several  stu¬ 
dies  have  been  dedicated  to  this  effort  [4-8]. 

In  general,  an  interconnection  line  on  an  IC  Chip  consists  of  several  components 
such  as  straight  segments,  bends,  steps,  plugs  and  vias.  In  addition,  there  are  power 
and  ground  buses  serving  several  logic  gates  on  the  chip.  For  submicron  width  inter¬ 
connection  lines,  there  can  be  sections  along  the  line  length  suffering  from  material 
overflows.  In  this  report,  for  the  first  time,  we  also  introduce  a  multisection  intercon¬ 
nection  which  can  be  designed  in  three  possible  configurations:  horizontal,  vertical  and 
mixed.  A  multisection  intersection  differs  from  a  generally  employed  interconnection 
in  that  a  driver  and  its  load  are  connected  by  more  than  one  interconnection  line  thus 
providing  more  than  one  path  for  the  currentA^oltage  signal  to  flow. 

In  this  report,  we  present  an  analysis  of  the  electromigration-induced  failure  in 
each  of  the  interconnection  components  listed  above.  First,  using  the  effects  of  the 
average  flux  density  on  the  grain  boundary  migration  in  the  interconnections,  we  have 
derived  expressions  for  the  effective  lengths,  widths  and  thicknesses  of  the  staight  seg¬ 
ments  equivalent  to  each  of  the  components.  Then  we  have  used  the  series  model  of 
failure  mechanisms  in  the  interconnections  [9]  to  determine  the  series  or  series-parallel 
combinations  of  straight  segments  equivalent  to  each  intercormection  component. 
Finally,  we  have  studied  the  dependence  of  the  electromigration-induced  Median- 
Time-to-Failure  (MTF)  Shd  the  sffindard  deviation  of  the  conesponding  lognormal 
failure  distribution  (a)  on  the  various  parameters  of  each  interconnection  component. 
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2.  REDUCTION  OF  COMPONENTS  INTO  STRAIGHT  SEGMENTS 

First,  we  have  analyzed  a  straight  interconnection  segment,  shown  in  Figure  1(a), 
of  len^  L,  width  W  and  thickness  T  carrying  a  current  /  at  a  given  temperature. 
Then,  by  considering  the  effects  of  the  average  flux  density  on  the  grain  boimdary 
migration  m  each  interconnection  component,  we  have  reduced  it  to  a  series-parallel 
combination  of  equivalent  straight  interconnection  .^gments.  The  average  flux  density 
in  a  component  was  determined  by  using  the  interconnection  current  /  and  the  average 
cross  sectional  area  throughout  the  component. 

The  additional  area  in  an  interconnection  bend  of  angle  0^,  shown  shaded  in  Fig¬ 
ure  1(b),  was  found  equivalent  to  a  straight  segment  of  length  Lq  and  width  Wq  given 
by  the  expressions: 


For  a  bend  angle  of  90  degrees,  these  expressions  yield  values  in  agreement  with  those 
derived  by  Frost  and  Poole  [9]. 

An  interconnection  line  of  length  L,  width  W  and  thickness  T  having  a  single 
step  of  height  H  and  angle  85,  shown  in  Figure  1(c),  is  equivalent  to  three  straight 
segments  each  of  width  W,  lengths  and  Lg^,  and  thicknesses  Tg^,  Tg^  and  Tg^ 

respectively  given  by  the  expressions: 


Lg^-L  + 


H 

tan  65 


Ts,-T 

Lg  =  T  cos  05  + 

*  sm  85 


Tg^  =  T  cos  05 
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It  T  ( 180  ^  Oj,)  (  1  -  cos  Os  ) 
is,  = - ^ - 

(  1  +  cos^  0s  -  sin  65  cos  65) 

“  2Ls^  +  T  (  1  -  cos  0s  )  " 

Two  straight  sections  of  an  interconnection  line  of  total  length  L  ,  width  W  and 
thickness  T  joined  by  a  single  plug  of  length  H  and  square  dimension  W/>,  shown  in 
Figure  1(d),  is  equivalent  to  three  straight  segments  of  lengths  Lp^  and  Lp^  widths 
Wp^,  Wp^  and  Wp^,  and  thicknesses  Tp^  and  Tp^  respectively  given  by  the  expres¬ 
sions: 

Lp^  =  L-Wp 

Tp,^T 
Lp^:^H 
Wp^  = 

Tp,  = 

Lp^^j(T  +  Wp) 

Wp^^W 

T(.W  +  Wp  ) 

^/’3= - L - 

An  interconnection  line  of  length  L,  width  W  and  thickness  T  having  a  length  Lq 
suffering  from  overflow  ( top  and  end  views  are  shown  schematically  in  Figure  1(e) ), 
is  equivalent  to  two  straight  segments  of  lengths  Lq.^  and  Lq^,  widths  Wq^  and  Wq^, 
and  thicknesses  Tq^,  Tq^  given  by  the  expressions: 
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—  ^  ~^0 


tg,^T 

Wo,  =  W 


Two  straight  sections  of  an  interconnection  line  of  total  length  L,  width  W  and 
thickness  T  joined  by  a  via  of  height  //,  width  Wy  and  angle  0^^,  shown  in  Figure 
1(f),  is  equivalent  to  four  straight  segments  each  of  width  W,  lengths  Ly^,  Ly^,  Ly^  and 
Ly^,  and  thicknesses  T’v,  .  TVj,  Tvj  and  Ty^  respectively  given  by  the  expressions: 


Lv,= 


Gy  K  T  (  1  +  COSiQy)  ) 


Tv.- 


[  1  +  cosGy  sinOy  +  cos^  0^  ] 
2  Ly^  +  T  [  1  +  COSGjr  ] 


H 

sinGi/ 


-  T  sinGv 
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Ty^^T 

A  horizontal  multisection  interconnection  of  length  L  consists  of  a  parallel  combi¬ 
nation  of  iV  interconnections  each  of  length  L,  width  W  and  tWckness  T  placed 
between  two  rectangular  pads  each  of  lengA  and  width  Wp^  =  (2N  -  1)W  on 
each  side  which  are  in  turn  connected  to  the  interconnection  driver  and  its  load.  A 
schematic  diagram  of  the  top  view  of  a  horizontal  multisection  mterconnection  is 
shown  in  Figure  1(g). 

A  vertical  multisection  interconnection  of  Length  L  consists  of  a  parallel  combi¬ 
nation  of  N  interconnections  one  of  which  is  printed  on  top  oLthe  substrate  while  the 
others  are  embedded  in  the  substrate  exactly  below  the  top  section..  Each-section  is  of 
length  L,  width  IV  and  thickness  T.  The  sections  are  connected  to  each  other  at  the 
ends  by  conducting  plugs  each  of  length  Lp.  A  , schematic  diagr^  of  the  side  view  of 
a  vertical  multisection  interconnection  is  shown  in  Figure  ^h),.  A  ntixed  multisection 
interconnection  is  fomied  by  mixing  the  horizontal  and'vp.rtici^  multisectioh  intercon¬ 
nections  i.e.  it  has  a  few  sections  printed  on  top  of  the  substrate  in  additioh  to  a  few 
sections  embedded  in  the  substrate. 

As  shown  in  Figure  l(i),  a  power  or  ground  bus  serving  Ng  gates  on  the 
integrated  circuit  chip  was  modelled  as  a  series  combination  of  N  straight  sclents 
carrying  currents  equal  to  1,21,  31, . .  Ngl  where  I  is  the  current  in  each  gate. 


3.  CALCULATION  OF  MTF  AND  LOG-NORMAL  STANDARD  DEVIATION 

First,  for  a  basic  conductor  element  of  length  lOp^n,  the  median-time-to-failure 
was  found  by  using  the  expression  [9]: 


MTF  =  1,523.0 


W  T 

n 

w  _  ^07  4- 

/xlO^ 

,  4 

10,740.74 
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where  /  is-therihtercpm^  in  itiA,  n  is  the  current  density  exponent,  is 

/the  a^yation  energy  of  the  interconnection  materid  in  eV,  is  the  temperature  in 
Kelvins^  W  is?  die  interconnection  width  in  ]m  and  T  is  the  interconnection  thickness 
ilhen,  a  first  approxunation,  the  median-time-to-failure  of  a  series  combi¬ 
nation  bfW  elements  (  )  was  found  by  using  the  expression: 

1  1  4-  1  4.  4.  1 

'MlWs  .  MTFf^ 

whereas  that  of  a  parallel  combination  of  ff  elements  ( MTFp  )  was  found  by  using  the 
expression: 

mFp  =MTFi.  +  MTF2  + . +  MTFjv 

The  lognormal  standard  deviation  (o)  of  a  basic  conductor  element  of  width  W 
(pw)  was  given  by  [9]: 


4.  THE  PROGRAMS  "EMVIC”,  "EMVIC-Z”  and  "EMGRAPH" 

To  date,  we  have  developed  three  programs  called  EMVIC,  EMVIC-2  and 
EMGRAPH.  Each  of  the  programs  is  interactive,  and  extremely  user-friendly.  EMVIC 
and  EMVIC-2  are  written  in  FORTRAN-77  while  the  graphics  program  called 
EMGRAPH  is  written  in  C. 
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ilie  pfogfam;iEMYICican  be  u^d  to  detennine  the  MTF  and  Lognormal  standard 
deviation  of  a  straight,  interconnection  segment  (sis),  interconnection  bend  (ib),  inter¬ 
connection  step  (is),,  interconnection  plug  (ip),  interconnection  via  (iv),  interconnection 
overflow  (io),  horizontal  multisection  interconnection  (hmsi),  vertical  multisection 
interconnection  (vmsi),  mixed  multisection  interconnection  (mmsi)  and  a  power/ground 
bus  (pgbus).  (The  symbols  in  parentheses  were  used  to  name  the  data  files  as 
explained  below.)  First,  the  user  uses  the  default  values  or  chooses  his/her  own  values 
for  the  several  parameters  of  any  component  listed  above.  For  a  straight  segment,  the 
parameters  include  its  length  (il),  width  (iw),  thickness  (it),  temperature  (temp),  current 
(curr),  ciurent  density  exponent  (cde)  arid  its  material’s  activation  energy  (iae).  In  addi¬ 
tion  to  these  parameters,  the  other  components  are  defined  by  the  additional  parameters 
listed  below: 

INTERCONNECTION  BEND:  Bend  Angle  (ba) 

INTERCONNECTION  STEP:  Step  Height  (sh).  Step  Angle  (sa) 

INTERCONNECTION  PLUG:  Plug  Length  (pi).  Square  Plug  Dimension  (pd).  Plug 

Material  Activation  Energy,  Lower  Level  Material 
Activation  Energy 

INTERCONNECTION  VIA:  Via  Height  (vh),  Via  Width  (vw).  Via  Angle  (va). 

Lower  Level  Material  Activation  Energy 

INTERCONNECTION  OVERFLOW: 

Overflow  Length  (ol) 

HORIZONTAL  MULTISECTION  INTERCONNECTION: 

Number  of  Horizontal  Sections  (nhs),  Source/Sink  Pad 
Lengths  (spi) 

VERTICAL  MULTISECnON  INTERCONNECTION: 

Number  of  Vertical  Sections  (nvs).  Vertical  Plug 
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Lengths  (vpl).  Plug  Material  Activation  Energy 


y- 


MCPP -I^LtlSEenON  INTERCONNECnON: 

Number  of  Horizontal  Sections  (nhs),  Number  of 
Embedded  Vertical  Sections  (nvs),  Vertical  Plug 
Lengths  (vpl),  Source/Sink  Pad  Lengths  (spl),  Plug 
Material  Activation  Energy 

POWER  OR  GROUND  BUS:  Number  of  Gates  Served  by  the  Bus  (ng),  Cunent  in 

Each  Gate  (gc) 

After  the  user  defines  the  component,  EMVIC  calculates  the  MTF  and  a  for  it  and 
displays  these  values  on  the  screen.  The  user  can  choose  to  write  the  simulation  res.  Jts 
on  an  output  file  called  EMVIC.OUT. 

The  program  EMVIC-2  incorporates  and  extends  the  program  EMVIC  in  the 
sense  that  it  allows  the  user  to  study  the  dependence  of  MTF  and  a  for  any  component 
on  its  several  parameters.  First,  the  user  can  define  the  reference  component.  Then 
he/she  can  select  a  variable  parameter  and  choose  its  lowest  and  highest  values  for  the 
dependence  analysis.  The  number  of  points  at  which  the  analysis  is  to  be  carried  out 
can  also  be  selected  by  the  user.  Then,  EMVIC-2  calculates  MTF  and  a  at  evenly  dis¬ 
tributed  points  in  the  range  of  analysis  and  writes  the  results  on  an  output  file  called 
EMVIC-2.QUT.  It  also  creates  data  files  named  COMPONENT-PARAMETER.DAT 
one  for  each  parameter  of  every  component  analyzed  by  the  user.  The  values  of  COM¬ 
PONENT  and  PARAMETER  are  the  symbols  enclosed  in  the  parentheses  above. 
These  data  files  are  written  in  a  format  required  by  the  graphics  program  EMGRAPH. 
At  this  point,  the  user  can  use  EMGRAPH  to  plot  the  results  of  any  of  the  above 
dependence  studies.  EMGRAPH  also  allows  the  user  to  change  the  appearance  of  the 
plot  in  order  to  create  custom  plots.  These  plots  can  then  be  sent  to  6  printer.  The 
source  codes  of  EMVIC-2  and  EMGRAPH  contain  nearly  6,000  and  1,000  lines, 
respectively.  The  flow  chart  of  EMVIC-2  is  shown  in  Figure  2. 
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5.  SIMULATION  RESULTS 

The  program  EMVIC-2  has  been  used  to  study  the  dependence  of  MTF  and  a  on 
the  various  pwameters  of  each  interconnection  component.  In  the  following  results, 
current  density  exponent  was  set  at  1.0.  (Due  to  length  restriction  on  this  report,  only 
a  few  results  will  be  included  here.) 

First,  for  a  strmght  interconnection  segment,  the  dependence  of  MTF  and  a  on 
the  segment  width  in  the  range  0.5-5  |imis  shown  in  Figure  3.  The  relatively  sharp 
increase  in  MTF  and  a  for  widths  less  than  nearly  2  pm  is  due  to  the  so-called  "bam¬ 
boo"  effect  [4,5].  For  an  interconnection  bend,  the  dependence  of  MTF  and  o  on  the 
bend  angle  in  the  range  10-150  degrees  is  shown  in  Figure  4.  This  figure  shows  that  a 
bend  ^gle  of  nearly  50  degrees  results  in  the  lowist  value  of  MTF.  For  an  intercon¬ 
nection  step,  the  dependence  of  MTF  and  a  on  the  step  angle  in  the  range  90.1-160 
degrees  is  shown  in  Figure  5.  This  figure  show.*!  that  MTF  decreases  rapidly  as  the 
step  angle  approaches  90  degrees.  This  is  becfr.se  of  the  gradual  thinning  of  the 
material  at  the  step.  For  a  horizontal  multisectioic;'  interconnection,  the  dependence  of 
MTF  and  a  on  the  number  of  horizontal  sections  in  the  range  1-5  is  shown  in  Figure 

6.  This  figure  shows  that  MTF  varies  nearly  as  where  n  is  the  number  of  sections. 
This  is  because  the  cunent  density  in  each  section  is  nearly  (l/«)  of  that  in  the  original 
single-section  interconnection  and  further  because  all  sections  must  fail  before  the 
interconnection  fails  completely.  Similar  dependence  on  the  number  of  vertical  sections 
was  observed  for  the  vertical  multisection  interconnection  as  shown  in  Figure  7.  How¬ 
ever,  compared  to  the  horizontal  configuration,  vertical  multisection  interconnection 
offers  the  advantage  that  it  does  not  require  any  additional  space  on  the  chip.  For  a 
1,000  \un  long  power  or  ground  bus  serving  100  identical  gates,  the  dependence  of 
MTF  and  a  on  the  current  in  each  gate  in  the  range  0.1-1  mA  is  shown  in  Figure  8. 
This  figure  shows  that  increasing  the  gate  ciurents  results  in  lower  values  of  MTF  for 
the  bus,  as  expected. 
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6.  SUMMARY  AND  CONCLUSIONS 

To  suimname,  we  have  carried  out  a  first-order  analysis  of  the  electromigration 
induced  failure  effects  in  the  several  interconnection  con^onents.  First,  each  com¬ 
ponent  has  been  reduced  into  a  series-parallel  combination  of  equivalent  straight  inter¬ 
connection  segments  and,  then,  the  series  model  of  failure  mechanism  has  been  used  to 
determine  the  MTF  and  the  corresponding  lognormal  standard  deviation  (cr)  for  each 
component.  The  algorithms  have  been  used  to  study  the  dependence  of  MTF  and  a  on 
the  various  paranieters  of  each  interconnection  component.  Though,  due  to  the  approri- 
mations  inherent  in  the  series  model,  the  analysis  presented  in  this  report  gives  approx¬ 
imate  results,  yet  these  can  be  used  to  draw  important  conclusions  regarding  the  optim¬ 
ization  of  the  various  interconnection  componettts. 
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Figure  1(g);  Schematic  diagram  of  a  Horizontal  Multisection  Interconnection 


Figure  1(h):  Schematic  diagio^  of  a  Vertical  Multisection  Interconnection 


Figure  l(i):  Schematic  diagram  of  the  Power  and  Ground  Buses  Serving  Ng 

Gates  on  the  Chip 
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(Continued  on  Next  Page) 
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A 


Write  Simulation  Results  in 
EMVIC2.0UT  and  in  COMP-VP.DAT^ 


(*NOTE:  The  file  COMP-VP.DAT  stands  for  Component 
Name^Variable  Parameter  Name.DAT) 

Fig.  2.'  Flow  Chart  of  EMVIC-2 
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Figure  3:  MTF  and  Deviation  va.  Width  for  818 
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Figure  4:  MTF  and  Deviation  va.  Angle  for  a  Bend 
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interconnection  Step  Angle  (Degrees) 

Figure  6:  MTF  and  Deviation  vs.  Angie  for  a  Step 
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Number  of  Horizontal  Sectione 
Figure  6:  MTF  and  Sigma  vs.  No.  of  Sections  for  HMSI 
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Numl^  of  Vertical  Sections 


Figure  7:  MTF  and  Sigma  ye.  No.  of  Sections  for  VMSI 
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Figure  8:  MTF  and  Sigma  vt.  Gate  Current  for  a  F/G  Bus 
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1  INTRODUCTION 


My  original  plan  was  to  work  this  summer  on  optical  wave  guides  and 
an  optical  Tunneling  AND  gate.  The  tunneling  devices  have  to  be  fabricated 
completely  in  vacuum.  A  robot  mask  and  substrate  changer  for  a  vacuum 
systern  at  Syracuse  University  Micro-Electronics  Laboratory  (SUMEL)  that 
is  to  be  used  for  the  fabrication  of  the  tunneling  devices  was  to  be  fabri¬ 
cated  In  the  Rome  Laboratory  (RL)  machine  shop. 

The  robot  mask  changer  was  designed  and  it  is  now  being  fabricated  in 
the  RL  machine  shop.  Preliminary  test  have  been  performed  Indicating  that 
a  light  sen.<>itive  tunneling  diode  can  be  fabricated.  Indeed,  the 
continuation  of  this  work  using  the  robot  mask  changer  is  the  subject  of 
my  follow  on  proposal. 

While  I  was  waiting  for  the  robot  rhask  changer  I  worked  on  two  other 
projects.  We  tested  a  liquid  crystal  Spatial  Light  Modulator  (SLM).  and  a 
binary  lens  for  replicating  light  beams.  This  work  Is  described  below. 

2  PHOTONIC  TUNNELING  AND  GATE 

The  Photonic  Tunneling  AND  Gate  consist  of  two  photosensitive  tunnel¬ 
ing  diodes  connected  electrically  connected  in  series.  The  individual 
diodes  only  conduct  when  illuminated.  Since  the  conduction  process  in  the 
diodes  is  due  to  the  photo  assisted  tunneling  effect  the  response  of  these 
devices  should  be  exceedingly  fast. 

The  device  consists  of  a  bottom  aluminum  film.  This  film  is  covered 
with  a  50  A  thick  aluminum  oxide  film.  This  oxide  film  form  the  tunnel 

barrier  of  the  device.  The  edges  of  the  bottom  film  are  covered  with  a 
protective  SiOg  layer  to  prevent  shorting  of  the  top  and  bottom  metal  lay¬ 
ers.  The  oxide  layer  is  covered  with  a  semitransparent  aluminum  film.  A 
thicker  aluminum  is  deposited  over  the  semitransparent  aluminum  film 
between  the  bottom  metal  layer  and  the  contact  for  the  top  metal.  Both 
metal  contacts  are  covered  with  a  copper  film.  All  these  films  have  to  be 
patterned.  The  patterning  is  achieved  by  depositing  the  various  films 
through  silicon  wafer  masks.  These  masks  are  fabricated  by  etching  a 
crude  pattern  of  the  particular  mask  from  the  bottom  of  the  wafer  pene¬ 
trating  to  within  a  few  pm's  of  the  top  face  of  the  wafer  is.  The  top  sur¬ 
face  is  etched  in  the  desired  pattern.  This  allows  very  accurate  pattern¬ 
ing.  5-2 


The  various  silicon  wafer  masks  and  glass  substrates  can  be  manipu- 
laited  by  the  mask  and  substrate  changer  robot. 

Since  the  oxide  layer  is  only  50  A  thick  it  is  necessary  for  the  bottom 
aluminum  layer  to  be  exceedingly  smooth.  Indeed,  aluminum  films  de¬ 
posited  on  glass  in  the  SUM  EL  are  smooth  on  a  scale  of  less  than  about  a 
few  hundred  A.  The  aluminum  films  where  analyzed  with  a  Scanning  Elec¬ 
tron  Microscope  (SEM)  at  the  RL.  The  SEM  photograph  showed  that  the 
aluminum  fils  were  smooth  to  the  limit  of  the  resolution  of  the  SEM  which 
is  of  the  order  of  a  few  hundred  A. 

3  LIQUID  CRYSTAL  LIGHT  MODULATOR 

3a  INTRODUCTION 

A  Spatial  Light  Modulator  (SLM)  is  a  device  designed  for  computer 
controlled  modulation  of  laser  light  on  an  individual  element,  or  pixel, 
basis.  SLM  fabrication  is  conventionally  done  in  two-dimensional  arrays  to 
provide  sufficient  area  to  cover  typical  laser  beam  dimensions  (on  the 
order  of  one  mm2),  jhe  primary  component  of  the  SLM  consists  of  an  array 
that  is  comprised  of  two  glass  plates  separated  by  a  thin  film  of  liquid 
crystal  material.  A  reflective  chromium  mask  creates  100  liquid  crystal 
elements  (pixels)  in  a  10x10  array  pattern.  Transparent  electrical  con¬ 
tacts  are  deposited  on  both  the  left  and  right  side  plates  of  the  liquid 
crystal  cell.  One  set  of  contacts  Is  patterned  and  other  is  a  common  elec¬ 
trode.  A  polarizer  is  provide  on  the  output  side  of  the  device.  By  applying 
a  voltage  across  the  contacts,  an  electric  field  is  produced  across  the 
molecules  in  the  liquid  crystal  which  orient  them  in  one  of  two  preferred 
states  that  differ  in  alignment  by  45  degrees.  The  state  in  which  the 
molecules  are  aligned  can  be  changed  by  switching  the  polarity  of  the 
electrical  contacts.  The  array  is  illuminated  with  polarized  coherent  light. 
The  output  polarizer  is  arranged  to  block  all  light  when  there  is  no  bias 
applied  across  the  device.  Applying  a  voltage  across  any  cell  of  the  SLM 
changes  the  alignment  of  the  liquid  crystal  molecule.s  which  in  turn  ro¬ 
tates  the  polarization  so  that  light  is  transmitted  through  the  output  po¬ 
larizer. 
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The  contact  switching  voltage  Is  generated  by  a  computer.  If  polarized 
light  is  incident  on  the  array  when  the  molecules  in  a  liquid  crystal  ele¬ 
ment  are  aligned,  the  element  functions  as  a  half-wave  plate.  A  half¬ 
wave  plate  rotates  the  polarization  of  the  incident  light  through  an  angle 
twice  that  formed  by  the  incoming  polarization  vector  and  the  molecular 
state  of  orientation  (optic  axis).  When  an  element  is  illuminated  with 
light  polarized  parallel  to  the  optic  axis,  no  rotation  in  the  polarization  of 
the  light  occurs  in  the  element's  throughput.  In  this  case,  the  analyzer 
minimizes  transmission  through  the  system,  since  its  transmission  axis 
is  perpendicular  to  both  the  input  polarization  and  optic  axis,  see  Fig.  1 . 
Applying  the  opposite  voltage  to  the  contacts  results  in  rotating  the  ele¬ 
ment's  optic  axis  45°  with  respect  to  its  original  orientation.  The  light 


Figure  1.  Operating  Principles  of  the  the  SLM. 

transmitted  through  the  array  therefore  undergoes  a  90°  rotation  in  polar¬ 
ization.  Consequently,  its  polarization  is  parallel  to  the  trans-mission 
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axis  of  the  analyzer  and  results  in  maximum  transmission  (see  Figure  1, 
OPEN  state).  The  pattern  of  light  transmitted  through  the  array  can  thus 
be  regulated  by  controlling  the  voltages  sent  to  each  element.  Further 
information  on  the  basic  principles  of  the  SLM  can  be  found  on  page  2  of 
the  Operator's  Manual. 

The  interest  in  investigating  a  spatial  light  modulator  was  to  deter¬ 
mine  the  feasibility  of  its  use  in  controlling  the  throughput  of  pixels 
generated  by  a  binary  phase  grating  element  that  splits  an  input  laser 
beam  into  25  spatially  separated  laser  beams.  For  this  purpose,  a  custom 
made  Ferroelectric  Liquid  Crystal  (FLC)  spatial  light  modulator  manufac¬ 
tured  by  Displaytech,  Inc.  of  Boulder,  Colorado,  was  tested  to  determine 
its  contrast  ratios  for  multiple  element  rnd  single  element  areas.  This 
report  describes  the  software  and  commands  used  to  control  the  device, 
the  experimental  methods  implemented  to  study  the  device,  and  the  exper¬ 
imental  results. 

3b  OPERATING  THE  SLM 

The  device  was  driven  with  the  Displaytech  DDR128  Driver  and  con¬ 
trolled  with  its  accompanying  Pattern  Generator  board  and  software.  The 
driver  provides  the  voltages  required  to  run  the  FLC  array  whereas  the 
board  communicates  to  the  driver  the  patterns  created  with  the  software. 
Once  the  board  and  software  were  installed  in  a  Zenith  150  computer,  any 
pattern  or  series  of  patterns  (sequences)  could  be  activated  on  the  FLC 
array  by  the  user.  This  section  is  best  understood  if  the  user  is  operating 
the  computer  software  as  it  is  read. 

Hardware  connections  and  software  installation  should  be  performed 
per  the  manufacturer's  Operator's  Manual.  The  manufacturer  warns  that 
leaving  the  array,  or  any  element,  in  a  particular  state  for  more  than  5 
seconds  could  possibly  damage  its  operation.  Discussion  of  other  potential 
operating  dangers  can  be  found  in  the  Operator's  Manual  and  the  DDR256 
Driver  for  FLC  Direct-Drive  Arrays  User's  Manual,  Version  1.0. 

3c  PRECAUTIONS 

The  manufacturer  warns  that  leaving  the  array,  or  any  element,  in  a 
particular  state  for  more  than  5  seconds  could  possibly  damage  its  opera- 
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tion.  Therefore  no  single  pattern  was  executed  with  a  step  time  longer 
than  4005  msec,  though  sequences  can  run  repeatedly  without  time  re¬ 
strictions.  In  addition,  patterns  were  followed  by  their  inverses  for  the 
same  durations  to  insure  that  no  element  was  operating  at  hazardous 
charge  build-up  levels.  The  sum  of  the  step  times  for  the  two  patterns 
should  not  exceed  5  seconds  as  charge  build-up  harmful  to  the  array  could 
occur  in  the  elements  that  do  not  switch  in  the  pattern  change. 

Further,  caution  should  be  exercised  when  applying  optical  power  to  the 
elements  of  the  array.  Oisplaytech,  Inc.  specifies  that  the  elements 
should  not  be  exposed  to  more  than  500  mW/cm^  (i.e.  5  mW/cm2)  from  a 
continuous  wave  laser.  For  this  reason,  the  optical  power  density  did  not 
exceed  4.5  mW/cm^when  investigating  a  single  element.  Special  care 
should,  be  taken  when  using  focusing  optics  to  illuminate  individual  pixels. 

3d  EXPERIMENTAL  METHOD 

The  tests  on  the  SLM  were  performed  at  both  830  and  1320  nm  diode 
laser  wavelengths.  The  SLM  is  intended  to  operate  at  a  wave  length  of 
1320  nrT5  The  device  was  characterized  by  DIsplaytech,  Inc.  at  830  nm. 
Therefore  the  performance  characteristics  of  the  SLM  at  830  nm  were 
studied  for  comparison  with  those  listed  by  the  manufacturer. 

As  mentioned  earlier,  the  FLC  array  and  output  polarizer  transmits 
light  when  the  polarization  of  the  input  beam  is  parallel  to  an  alignment 
of  the  liquid  crystal  molecules.  This  array  was  designed  so  that  it  would 
require  incident  light  that  was  vertically  polarized.  Thus,  the  first  tests 
done  at  each  wavelength  were  to  verify  the  poiarization  of  the  input  laser 
beams  and  to  determine  the  contrast  ratio  of  each  polarizer  used.  A  Gala 
laser  served  as  the  830  nm  light  source.  Using  a  broadband  Melles  Griot 
polarizer  and  a  Newport  835  Optical  Power  Meter,  the  Gala  laser  beam  was 
confirmed  to  be  vertically  polarized  as  the  polarizer  yielded  a  contrast 
ratio  of  approximately  1000:1.  At  the  1320  nm  wavelength,  a  "Lightwave” 
laser  was  used  It,  too  was  checked  for  its  polarization  with  the  1320  nm 
narrow-band  polarizer  supplied  as  part  of  the  SLM.  This  laser  also  was 
verified  to  emit,  vertically  polarized  light  with  the  contrast  ratio  of  the 
polarizer  being  on  the  order  of  105:1. 
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An  inspection  of  the  SLM  and  its  driver  followed  the  confirmation  of 
the  polarization  states  of  each  laser.  A  Newport  modular  beam-expanding 
filter  consisting  of  a  15x  microscope  objective,  a  25  pm  diameter  pin 
hole,  and  a  collimating  lens  was  inserted  into  the  system  to  produce  a 
uniformly  collimated  beam  expanded  to  illuminate  the  entire  array.  To  im¬ 
age  the  throughput  light,  a  lens  and  a  screen  of  facsimile  thermal  paper 
were  inserted  after  the  SLM,  as  shown  in  Figure  2  (not  drawn  to  scale). 

A  lead-selenide  CCD  camera  with  a  spectral  response  from  visible 
wavelengths  to  1.6  pm  was  used  to  acquire  the-  image  information.  The 
generated  analog  video  signal  was  then  fed  directly  to  a  TV  monitor  or  to 
a  Macintosh™  ll  computer.  A  frame  grabber  board  installed  in  the  computer 
digitized  the  image  and  the  software  package  Image  2.0  allowed  for  the 
generation  of  three-dimensional  intensity  plots. 


Figure  2.  Experimental  setup  for  qualitative  observation  of  the 
SLM's  response  to  bias. 

The  experiment  revealed  that  not  all  of  the  elements  were  capable  of 
switching  states  as  designed.  A  sequence  in  which  all  100  pixels  were  in 
the  "on"  state  (i.e.  allowing  for  the  passage  of  light)  for  4  seconds  and 
then  in  the  "off"  stale  for  4  seconds,  areas  of  the  array  unable  to  respond 
to  the  sequencing  instructions  could  be  seen.  In  this  device  only  pixels  in 
half  of  the  array  were  "on"  while  only  random  elements  on  the  other  half 
of  the  array  were  "on".  Despite  programming  the  entire  array  to  switch  on 
and  off,  these  "dead"  pixels  failed  to  respond.  Tracing  the  faulty  pixels  to 
their  electrode  pinouts  unveiled  that  all  of  the  non-operating  pixels  were 
controlled  through  the  same  channel  on  one  of  the  connectors.  Further  in- 
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ves^a!k}n  of  tSie  channel  reveslaf  iha!  port  C  cf  the  driver  failed  to  out- 
put  a  positive  voft^e  during  the  sequence.  The  pins  in  this  pert  main¬ 
tained  a  constant  voit^e  of  -14.6  V,  whereas  the  other  ports  had  pins  al- 
temating  ftpm  -14.6  V  to  14.6  V.  Various  pins  in  these  ports  did  not  di¬ 
rectly  swih;h  between  the  two  voltages;  an  intermediate  voltage  was 
^.'netimes  read  before  a  shift  was  completed. 

Once  the  functioning  status  of  the  FLC  array  was  determined,  the  setup 
v/as  modified  to  test  muftipixel  areas  of  the  array  for  their  contrast  ra- 
ho.s.  Tne  contrast  ratio  is  the  ratio  of  the  I^ht  power  transmitted  when 
the  SLM  aflcws  the  pass^e  of  light  (i.e.  the  ”on”  state)  to  that  transmit¬ 
ted  when  ir:  the  ”off  state  (power  is  at  a  minimum).  Ideally,  an  input  po¬ 
larizer  and  an  analyzer  witfi  mutually  perpendicular  transmission  axes 
could  be  rotated  synchronously  to  minimize  the  off  state  power  through 
the  array.  However,  since  the  polarization  of  the  input  beam  in  tiiis  setup 
was  supplied  by  toe  laser  itself,  it  was  necessary  to  rotate  toe  FLC  array 
and  leave  toe  laser  and  analyzer  fixed.  A  mount  was  constructed  for  toe 
array  housing  such  that  toe  device  could  be  rotated  about  toe  beam  axis. 

Because  half  of  toe  array  was  not  operating  correctly,  a  mask  was  used 
to  block  toe  light  from  tiie  defective  part  of  toe  array  so  that  test 
efforts  would  be  concentrated  on  toe  array's  working  elements.  The  opti¬ 
cal  power  meter,  connected  to  a  Zenith  248  computer  through  an  IEEE-488 
interface,  replaced  the  screen  and  toe  camera  so  that  power  measure¬ 
ments  could  be  acquired  and  saved  on  the  hard  disk. 

To  perform  a  similar  study  on  single  elements  of  toe  array,  the  modular 
beam-expanding  filter  was  removec  from  the  setup.  The  approximate  exit 
beam  dimension  of  the  Lightwave  laser  was  1.5  mm  x  0.7  mm  whereas  the 
pixels  had  dimensions  of  0.85  mm  x  0.85  mm.  Ev  m  though  each  laser  was 
listed  as  having  a  beam  diameter  smaller  than  the  width  of  an  element,  a 
lens  was  inserted  to  assure  that  the  diverging  laser  beam  would  be  fo¬ 
cused  down  to  illuminate  only  the  area  of  a  single  pixel. 

3e  EXPERIMENTAL  RESULTS 

A  dependence  of  the  contrast  ratio  on  the  orientation  of  the  FLC  array 
to  the  input  polarization  was  evident  from  the  data.  Table  2  lists  read 


filter  MuSc  FLC  array 

Rg.  3-  Experimental  setup  for  ptwer  measurements  of  multiple 
element  areas  on  the  SLM. 
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Figure  4.  Experimental  arrangement  for  measuring  the 
transmission  of  a  single  pixel. 
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Table  2  :  Test  data  for  multiple  element  areas  of  the  SLM 
ing  the  830  nm  source 

US- 

ings  taken  to  determine  tfie  contrast  ratio  for  a  multiple  pixel  area  of  the 
SLM  using  the  830  nm  source.  The  data  shows  how  a  change  in  the 
contrast  ratio  was  induced  by  rotating  the  array.  The  contrast  ratio  for 
the  area  fell  off  rapidly  from  the  maximum  as  the  array  was  rotated 
through  a  few  degrees  of  arc. 

Further  study  of  this  dependence  with  respect  to  individual  elements 
revealed  that  each  element  had  its  own  orientation  for  which  the  contrast 
ratio  was  maximized.  These  orientetions  varied  within  a  few  degrees  of 
the  orientation,  q^iaxt  f^at  produced  the  maximum  contrast  ratio  for  the 
multiple  pixel  area  discussed  above.  Table  3  displays  the  readings  ob¬ 
tained  from  four  different  elements  (designated  as  pixels  a,  b,  c,  and  d)  of 
the  SLM  also  using  830  nm  light  The  contrast  ratio  for  each  or  these  ele¬ 
ments  was  greater  than  that  found  for  the  multiple  pixel  area,  but  was  in 
agreement  with  Displaytech,  Inc.’s  measured  contrast  ratio  of  203:1  for  a 
single  element  The  weaker  performance  of  the  larger  area  directly 
stemmed  from  the  requirement  that  each  element  needed  a  different  ori¬ 
entation  to  maximize  its  contrast  ratio. 

The  samples  taken  while  using  the  830  nm  source  also  suggested  that 
the  two  states  of  the  optic  axis  for  the  multiple  element  area  did  not 
differ  by  45  degrees.  If  the  two  orientations  were  at  a  45  degree 


Rotation  of  SLM 

Power  (in  iiW) 

(mount  reading) 

Maximum 

Minimum 

Contrast  ratio 

Pixel  a 

30 

905 

8.1 

112:1 

5.50 

479 

3.9 

123:1 

5.50 

453 

3.8 

119:1 

Pixel  b. 

3.50 

759 

4.5 

169:1 

4.50 

900 

4.4 

205:1 

4,50 

937 

3.8 

247:1 

Pixel  c. 

3.50 

1201 

0.4 

222:1 

3.50 

1105 

5.1 

217:1 

Pixel  d 

4.50 

786 

4.0 

197:1 

4.50 

945 

4.4 

215:1 

Table  3:  Test  data  for  single  elements  of  the  SLM  at  830  nm 
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Rollon  of  SLM 
(mount  reading) 

P^mr  fui 

Ccntrasl  ratio 

Pixel  a  3® 

4$4 

21.9 

23:1 

Pixel  f.  9.5° 

1003 

3.2 

313:1 

9.5® 

1079 

3.3 

327:1 

Pixel  g.  9.5® 

1370 

2.3 

596:1 

Pixel  h.  10® 

1354 

0.54 

2571:1 

Table  4:  Test  data  for  single  elements  of 

the  SLM 

at  1320  nm 

Pixel  index 

Power  0n  jtVV) 

(rotation  fixed  at  9.5®1 

Maximum 

Contrast  ratio 

1 

1360 

3 

450:1 

2 

1450 

6 

240:1 

3 

1330 

1 

13.30:1 

4 

1380 

2 

6'?a:1 

5 

1430 

5.5 

2eCe:1 

6 

1390 

4 

3ai?:1 

7 

1410 

1.5 

940:1 

8 

1400 

1 

Table  5:  Test  data  for  random  elements  of  the  SLM  at  1320  am 
(rotation  fixed) 


separation,  similar  power  readings  would  be  seen  at  angles  fO  and  -B 
away  from  dmax*  The  maximum  power  would  decrease  as  tfse  anay  was 
rotated  away  from  qmax>  whereas  the  minimum  power  would  iiicrease. 
These  tendencies  were  not  present  in  the  measurements  The  maximum 
power  column  in  Table  2  reflects  the  non-symmetry  in  power  readings 
that  suggests  the  two  states  of  the  multiple  element  iver©  not  45> 
degrees  apart.  The  output  power  was  seen  to  increase  when  array 
rotated  away  from  q^ax  in  one  direction,  but  to  decrease  wnen  dono  s«  in 
the  opposite  direction.  The  separation  between  the  tw^o  states  jnay 
remain  constant  as  the  area  is  rotated.  Further  investigation  co-  id 
determine  the  separation  between  the  two  states  of  a  multiple  eiemjnt 
area  and  its  dependence  on  the  orientation  of  the  array  to  the  analyzer. 

The  measurements  completed  using  the  1320  nrv>  source  resulted  in 
significantly  higher  contrast  ratios  than  those  taken  using  830  nm  light. 


Table  4  lists  measurements  of  tiie  power  outputs  for  four  random  ele¬ 
ments  to  determine  the  contrast  ratio  that  could  be  expected  for  an  indi¬ 
vidual  element  illuminated  with  1320  nm  light  The  first  reading  may  not 
have  been  rotated  to  yield  the  best  contrast  ratio,  whereas  the  others 
were  taken  at  the  orientation  giving  the  lowest  ”off  state  power  trans¬ 
mission.  For  comparison,  a  contrast  ratio  for  a  multiple  element  area  was 
measured  to  be  107:1. 

tTeadings  taken  of  eight  random  elements  with  the  rotation  of  the  array 
fixed  to  one  orientation  also  demonstrated  how  each  element  had  to  be 
reoriented  to  maximize  its  contrast  ratio;  see  Table  5.  These  measure¬ 
ments  further  suggested  tiiat  the  SLM  functioned  better  when  iliuminated 
with  1320  nm  light  as  the  contrast  ratios  were  greater  than  those  made 
using  the  830  nm  source. 

It  is  important  to  note  that  the  magnitude  of  the  contrast  ratio  is  lim¬ 
ited  by  the  quality  of  the  polarizers  used  in  the  polarizer-analyzer  sys¬ 
tem.  if  the  system  has  a  maximum  to  minimum  transmission  ratio  of 
250:1,  then  the  SLM  cannot  have  a  contrast  ratio  greater  than  250:1. 
Although  the  SLM  appeared  to  function  better  at  1320  nm  than  at  830  nm, 
the  analyzer  at  1320  nm  extinguished  light  with  greater  efficiency  than 
the  830  nm  polarizer.  The  difference  in  performance  of  these  polarizers  at 
their  respective  wavelengths  may  have  accounted  for  the  significant  im¬ 
provement  in  the  contrast  ratio  of  the  device  when  studied  using  1320  nm 
light. 

Error  may  have  been  introduced  as  a  result  of  the  large  background 
noise  present  during  power  measurements.  The  background  noise  varied 
from  4  pW  to  14  pW  v/ith  a  fluctuation  of  approximately  0.5  pW.  This 
noise  was  1  to  4  times  greater  than  the  output  power  to  be  measured  when 
the  SLM  was  operating  in  the  "off"  state.  The  background  was  subtracted 
from  the  readings  using  one  of  two  processes.  When  measurements  were 
acqui'^ed  through  the  IEEE-488  interface,  the  background  was  calculated  by 
averaging  50  samples  of  the  power  meter  readout  with  the  source  beam 
blocked  before  a  series  of  measurements  was  conducted.  This  average 
background  was  subtracted  from  each  measurement  before  it  was  stored 
for  future  data  processing.  For  measurements  obtained  by  a  visual  reading 
of  the  power  meter's  LCD  display,  a  null  button  was  activated  while  the 
source  beam  was  blocked.  The  null  button  allowed  for  the  storage  of  the 
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reading  present  on  the  display  when  the  butter,  was  depressed.  The  power 
meter  would  then  subtract  the  stored  value  from  any  future  readings  be- 
fore  *hey  were  displayed. 

The  overall  assessment  of  the  SLM  is  that  it  is  not  optimal  for  the  in¬ 
tended  application  where  it  would  be  used  to  independently  switch  25 
spatially  separated  laser  beams.  The  switching  requirement  would  easily 
be  satisfied  by  the  manufacturer's  design  of  the  controller,  driver,  and 
software,  but  the  need  to  rotate  the  array  to  maximize  the  throughput  for 
each  element  disqualifies  the  array  as  a  candidate  for  the  modulation  of 
the  low  power  laser  beams  that  will  be  present  In  the  system.  The  varia¬ 
tion  in  con^ast  ratio  among  individual  pixels  at  a  fixed  angle  (see  Table 
5}  indicates  that  the  throughput  of  some  elements  would  not  be  maximized 
without  rotation  of  the  array.  The  SLM  is  generally  used  with  a  laser 
beam  of  sufficie.nt  power  so  that  the  reduced  throughput  would  not  ad¬ 
versely  affect  the  dynamic  range  of  a  detection  system.  In  this  applica¬ 
tion,  the  decreased  throughput  that  will  result  in  the  pixels  that  are  not 
oriented  at  their  maximum  would  cause  detection  of  the  low  power  laser 
beams  to  be  close  to  the  noise  level  of  a  photodetector. 

The  splitting  of  the  laser  beam  into  symmetric,  low  power  beams  is  a 
basic  design  criteria  and  cannot  be  changed.  Therefore,  another  scheme 
must  be  employed  to  switch  the  25  spatially  separated  laser  beams. 

4  BINARY  LENS 

V/e  tested  a  binary  lens.  This  device  replicates  j  '"ingle  light  beam  in  a 
25  by  25  beam  array.  The  binary  lense  consists  of  a  g.ass  plate  in  which  a 
two  dimensional  grading  like  pattern  was  etched.  The  pattern  of  this  par¬ 
ticular  device  has  only  two  levels,  the  top  surface  of  the  glass  plate  and 
the  etched  level.  The  etched  area  is  about  6  by  6  mm.  The  lens  was  de¬ 
signed  to  operate  at  a  wavelength  of  1.3  pm.  We  tested  the  device  both 
with  a  uniform  collimated  beam  from  a  spatially  filtered  lasci  beam  and 
alight  beam  directly  from  a  laser.  The  device  had  an  uniformity  from  beam 
to  beam  of  60%.  The  beam  diverges  when  it  emerges  from  the  lense.  It  has 
to  be  recoilimated  with  a  lense. 

The  devise  v/as,  first,  tested  by  projecting  the  light  output  of  the  lense 
on  a  screen.  A  lead-selenide  CCD  camera  with  a  spectral  response  from 
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visible  wavelengths  to  1.6  fim  was  used  to  acquire  the  image  information 
from  the  screen.  The  analog  video  signal  generated  was  then  fed  directly 
to  a  TV  monitor  or  to  a  Macintosh™  I!  computer.  A  frame  grabber  board 
installed  in  the  computer  digitized  the  image  and  the  software  package 
image  2.0  allowed  for  the  generation  of  three-dimensional  intensity  plots. 

The  device  was  also  tested  by  measuring  the  light  power  in  each  output 
beam  with  an  optica!  power  meter.  The  uniformity  of  the  device  was  de¬ 
termined  with  the  power  meter. 
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aiMaciERizmoN  of  RaraR  clcttper  as  a>j  siep 

CHaELES  T.  TGEENER  and  JAY  K.  LEE,  Ph.I). 

fi^TRaCT 

It  has  been  proposed  that  radar  clutter  can  be  modeled  as  a  spherically 
invariant  random  process  or  SIRP.  SIRPs  seem  well  suited  to  this  role  since 
by  variation  of  certain  paraiKters  the  Weibull/  K-  or  Rayleigh  distributed 

t 

clutter  envelopes  are  obtained.  These  distributions  are  significant  since 
they  fit  veil  with  eaqjeriroental  radar  clutter  data  under  different  circum¬ 
stances.  In  this  report,  a  radar  clutter  model  based  on  rough  surface 
scattering  is  developed  tosiow  that  SIRP  characterization  can  be  based  on 
electromagnetic  principles.  Small  perturbation  analysis  of  a  t^ro-scale 
randanly  rough  surface  is  chosen  since  the  form  for  the  badcscattered  field 
has  the  proper  form  for  an  SIRP,  under  certain  conditions.  This  is  an  im¬ 
portant  step  in  being  able  to  predict  the  proper  statistical  distribution 
of  radar  clutter  based  on  surface  geonetry  and  electromagnetic  properties. 
INTRODUCTION 

The  characterization  of  radar  clutter  has  been  an  important  study  for  radar 
engineers,  fron  the  inception  of  radar,  to  the  present.  Due  to  the  randan 
nature  of  background  terrain,  statistical  characterization  seems  appropriate 
from  both  theoretical  and  experimental  considerations.  Gaussian  models 
have  been  cormonly  used,  but  they  become  inadequate  in  many  cases  of  inter¬ 
est.  Other  characterizations  have  been  proposed  on  the  basis  of  empirical 
studies,  axaitples  being  Rician  [1],  Weibull  [2],  log-normal  [3],  and  K- 
type  [4,  b].  More  recently  it  has  been  proposed  that  radar  clutter  be 
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modeled  as  a  conrolex  random  process  [6],  more  specifically  as  a  spherically 
invariant  random  process  (SIRP)  [7-9].  Briefly,  an  SIRP  is  such  that  given 
a  real  or  conplex  random  process 

every  sanple  tak^  from  it,  that  is 

is  spherically  invariant  random  vector  (SIRV)  Tdth  one  and  the  same  charac¬ 
teristic  cvnnulative  distribution  function  (CCDF).  An  SIRV  can  be  thought 
of  as  being  obtained  by  a  product  of  a  Gaussian  random  vector  times  an  in¬ 
dependent  non-negative  random  scale  factor  s  ^ath  seme  CDF,  idiich  is  the 
CCDF  of  SIRV  under  consideration.  The  usefulness  of  the  SIRP  characteri¬ 
zation  comes  by  varying  parameters  of  s  to  obtain  different  clutter  statisics. 

It  can  be  shown  [6]  that  for  certain  parameters  the  Rayleigh,  K-,  or  Weibull 
clutter  envelopes  result.  This  unifying  concept  is  an  important  step  in 
fxirther  understanding  clutter. 

The  purpose  of  this  report  is  to  develop  a  mathematical  model  for  clutter 
from  electromagnetic  theory,  suitable  as  a  basis  for  an  SIRP  characterization. 
Specifically,  the  form  of  received  signal  (from  clutter)  should  be  of  the 
form  V(t)  =  G(t)  •  s  where  G(t)  is  a  Gaussian  random  process,  s  is  a  random 
variable,  and  V(t)  is  the  received  signal.  Without  regard  to  type  of  receiver, 
the  signal  V(t)  sought  is  the  backscattered  field  E^(t),  As  a  further  speciali¬ 
zation,  concentration  will  be  on  a  single  sample  (or  instance  of  time)  corres¬ 
ponding  to  the  returns  from  a  single  radar  illumination  area  or  "footprint". 
DISCUSSION  0}  APPROPRIATE  MODELS 

In  the  attempt  to  derive  useful,  accurate  mathematical  models  for  radar  clutter, 
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tTO  approacihes  are  ccxiimonly  used.  Tb£  first  views  the  ^ilvmiinated  area  or 
target  bad^ound  as  a  collection  of  point  scatterers  [10].  These  scatterers 
are  assumed  randanly  distributed  over  a  surface,  each  having  an  associated 
anplitude  and  phase.  In  the  most  general  case,  the  amplitude  and  phase 
distributions  are  randan  variables.  Classically,  by  making  various  assump¬ 
tions  about  these  distributions,  different  clutter  characteristics  are  de¬ 
rived,  e.g.  a  Rayleigh  distributed  envelope  results  when  the  phase  is  assumed 
equal  and  constant.  This  phenonenological  approach  is  useful,  but  not  really 
derived  fron  first  principles. 

A  second  approach,  and  the*  one  used  in  this  report,  is  treating  radar  clutter 
as  scattering  fron  a  rough,  randomly  distributed  surface.  A  rough  surface 
can  be  described  in  a  systan  of  Cartesian  coordinates  as 

21=  Kc^,y) 

^^here  ^  is  a  random  variable  depending  on  x  and  y.  The  mean  level  of  the 
surface  is  the  plane 

z  =  o 

Any  point  on  the  surface  can  be  described  by  a  position  vector  from  the 
origin 

^  ^  AS.  V 

r  =  Crv,/; 

A  randan  surface  can  be  described  by  the  statistical  distribution  of  its 
deviation  fron  a  mean  level.  This,  however,  does  not  completely  describe 
the  surface;  it  says  nothing  about  how  close  or  far  apart  the  hills  and  de¬ 
pressions  are.  A  second  function,  the  correlation  function,  or  its  normal¬ 
ized  version,  the  correlation  coefficient  describes  this  aspect  of  the  sur¬ 
face.  The  correlation  coefficient  C(t)  gives  a  measure  of  dependence  or 
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correlation  of  two  points  ^(x,  /  y, )  and^Cx^,  y^)  on  the  surface.  If  the 
separation  of  the  points 

nr  ^  r/2-X')^ 

is  very  large,  (not  on  the  same  hill  or  valley)  then  the  points  are  in¬ 
dependent.  Conversely,  points  near  one  another  will  be  correlated;  ^dien 
*^=0,  they  are  the  same  point,  or 
|inr\  C  (n:)  =  1 

The  dis^ance  T  in  ^diich  C('t)  drops  to  e*'  is  called  the  correlation  length. 

A  commonly  used  distribution  is  the  normal  distribution  with  zero  mean 

where  o-  is  the  rms  value  of  the  deviation  of  ^  from  z=0.  A  commonly  used 
correlation  coefficient  is 

C(r)-  ^-*-9 

Specification  of  c-  and  T  makes  the  rough  surface  model  appro^cimate  a  wide 
variety  of  rough  surfaces  met  in  practice. 

Scattering  of  waves  (acoustic  or  electromagnetic)  from  a  rough  surface  is 
a  complex  problem  studied  by  many  people  in  this  century  and  yet  it  re¬ 
mains  a  popular  topic  for  research  papers.  Many  methods  of  analysis  have 
been  used  to  study  scattering  from  a  rough  surface,  but  usually  each  method 
assumes  a  priori  scane  condition  or  scale  limitation  to  malce  the  problem 
tractable. 

T\ro  important  examples  of  the  methods  are  the  Kirchhoff  approximation  (or 
tangent  plane  approximation)  and  the  small  perturbation  method  (SPM).  In 
the  former  an  assumption  is  made  that  the  incident  ^/avelsngth  is  much 
smaller  than  the  variations  of  the  surface.  Under  this  assumption,  an 
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incident  wave  locally  "sees"  a  planar  surface.  Reflection  (and  trans¬ 
mission)  at  a  planar  interface  is  a  classic  problem  in  electromagnetics, 
so  by  integrating  over  the  surface,  a  solution  can  be  found.  Mathematic¬ 
ally,  the  validity  of  the  solution  is  restricted  by  the  condition,  derived 
by  Brekhovskikh  [11],  that  the  radius  of  curvatvure  of  the  surface  is  much 
greater  than  the  incident  wavelength  or 

^TTy'^  los  A  (as  reported  by  Beckmann) 

where  rj^  is  the  minimum  radius  of  curvature  in  any  direction  and  is  the 
angle  between  the  incident  propagation  vector  k  and  the  local  normal  n.  A 
second  condition  [10],  which  intuitively  needs  to  be  satisfied,  given  the 
condition  on  r^.  is;  the  correlation  length  must  exceed  the  ^ravelength 

-r>  A 

(Beckmann's  formulation  [see  10]  is  classic  and  widely  used.) 

Sometimes  a  condition  is  used  for  stationary-phase  or  geometric  optics,  which 
includes  that  of  physical  optics,  [12,  13]  namely 

This  has  the  advantage  that  the  rms  height  deviation  jris  related  directly 
to  \.(k  Because  the  rms  height  o-  and  correlation  length  T  are  large 

compared  to  A  f  this  type  of  surface  is  called  large  scale  rough  surface. 

At  the  other  extreme  is  the  SPM.  In  this  method  it  is  assumed  that  the  rms 
height  is  small  compared  to  the  wavelength 
4  <105^  0  CT-*"  1 

where  B  is  the  angle  of  incidence. 

Rice  solved  this  problem  using  a  two-dimensional  Fourier  series  expansion 
of  the  surface  [14].  The  coefficients  of  the  Fourier  components  are  deter- 
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froE  bcsaSsSasy  cassnifciocs  ca  tSie  sarfacs.  A  tiariafcioia  cm  tisis  mefeSacia 
was  de^ijlopea  Smrscws  [15]*  aSieie  Sw  also  asssanassCT  t&e  sms  sloce  is  smaSl, 


>-'f- 
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the  surface  aqsstiora  cam  ce  sqaacsd  in  a  sayicr  series  abcoifc  i£s  mean,- 
Osii^  tbe  slcce  a^JEsstioa,  isj^aer  orcer  tanns  cam  is  rasglsrted-  'i6em, 

in  a  EJsnner  similar  to  3ics,  tfe  ssct  bcsHcary  ccrsiSitiess  cm  tis  certurnec 
surface  are  rewritteE^  oomditicms  whici  aspiy  to  iSis  ussrturtJ^  piaisr 
surfac®  a’iid  a  solution  is  found.  Ss  say  be  giss^d  sucii  surfacss  are  calied 
small  srale. 

Whai  modelii^  surfacss  oocurri*^  in  nature,  the  a5gasgtti:si  of  eacltsiwely 
large  or  -«?tCTn  scale  is  restrictive  and  unrealistic  whsi  ocEsared  to  cKidem 
radar  wavelei^h.  If#  for  exairole#  an  ocean  vers  to  be  aodeled  as  a  large 
scale  surface#  the  radar  vaveleigth  would  need  to  be  subcdlliEeter 
due  to  the  Presence  of  fine#  wind  produced  ripples.  Cmivar^ly,  if  it 
vrere  modeled  as  a  small  scale  surface#  the  correspcsiding  radar  vaveleigth 
would  be  on  the  order  of  tens  of  neters,  because  of  the  large  sea  swells 
or  gravity  waves.  Such  is  the  case  with  most  surfaces  of  interest  with 
regard  to  radar  clutter. 

To  accanmodate  analysis  of  more  realistic  surfaces,  a  two-scale  or  ccmroosite 
model  has  been  proposed  and  studied  by  several  authors  [12,  16-21].  A 
conposite  surface  is  simply  viewed  as  a  super^sition  of  a  snail  scale 
variation  on  a  large  scale  variation. 
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"sts  divisiOTB  ffletaaga  seals  asaa  sssll  seals  coobes  aafanally  ewes  tSia 
rsasz  ■SESweSssgta  is  ssscifisS-  5»Siils  »Jiis  n-sdiei  g’as  seme  liinitatioBS  of  its 
c)3GB;  it  is  fer  HKJES  ■ssrsafcils  tiiaa  eit&er  tSie  lac^  or  srngii  scale  hogeI 
aicane. 


'Efeso  of  t!ie  sore  istSESstii^  amd  simple  amprcsi&as  to  tifith  cammajsite 

sssrfaces  are  tiis  tilted  plane  metJiod  and  a  more  ^jeralized  SE!I-  Is  tiie 
tilted  plama  East&od,  tSis  la^^  scale  ■variatim  is  ;=^i!w>=d  to  pre^st  locally 
fiat  titled  plaae  areas  ^dth  snpsriisxssed  saall  seals  variatitm.  'Sis  oaly 
difference  ia  tbs  asslysis  ietaesi  tMs  and  aw  otier  sell  scale  rsofel  is 
tSst  the  awle  of  incidasce  eith  regard  to  tha  normal  les  effectively 


a)  ^avs  incidsit  uprei  snail  b)  save  incidait  iroon  coeposita 

scale  surface  surface 


The  SHI  applied  to  a  conoosita  surface  is  also  siniilar  to  that  ar^liad 
to  an  exlusively  snail  scale  surface-  The  exact  boundary  conditions  on 
the  surface  are  transferred  to  a  systan  of  boundary  conditions  on  the  large 
scale  surface  instead  of  the  surface  z=0.  Effectively,  the  variations 
of  scattered  field  due  to  the  snail  scale  roughness  are  vieued  as  being 
caused  by  ecpiivalent  surface  current  densities  which  create  the  same  result¬ 
ant  field  as  the  small  scale  roughness.  The  total  scattered  field  is  the 
sum  of  zeroth  order  cemrponent,  due  only  to  the  small  scale  variation.  Brovm 
[12]  and  Bass  et  al.  [19,  20]  have  both  applied  this  method  to  a  composite 
scale  smrface. 
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Beggase  of  tS»  sslatively  sisroie  foasa  of  fesie  scattered  field  cco^leS  ^rith 
tne  fast  tSsfc  cross  golarisatice  teros  zre  Sstaiit  ia  fefee  fiaai  fcis,  fcSie 
ScM  for  coEECsita  sarfacs  is  asefsil  as  a  Ssasis  to  cfiaracterize  radar  clutter 
as  aa  SIS?-  S.  ssort  saasary  of  tliis  retSioa  ai^sears  ia  foilosisg 
CH  S!g>TT.  psgIt^^^:?4T7(g^  j»aL¥STg 

“Sie  first  case  treated  is  tlEt  of  a  ssall  scale  surface  cnly.  Hiis  is  dcire 
to  illustrate  t!e  Esain  points  of  tte  ai^ysis  vitlKJUt  becsKdi^  unduly 
boggad  daxn  in  cJEUjersccs  ^tails.  FoUoifing  this  siErnary,  it  is  shosn 
h057  ssmll  perturbation  analysis  is  aOTiisd  to  a  coEposite  surface  as  a  logi¬ 
cal  I^Xt  StCT- 

In  the  scatterii^  analysis  of  a  ssall  scale  surface,  the  surface  (denoted  X) 
is  r^resented  by  normal  deviatioi  C  (t  )  from  a  smoth  surface  z=0  daiotea 


Ihe  normal  to  the  surface  51  is  denoted  N,  while  the  normal  to  the  surface 
z=0  is  denoted  n.  It  is  assumed  that  the  rough  surface  divides  a  vacuum 
and  medium  xrith  relative  permittivity  £  .  Also  it  is  assumed  that  the 
permeability is  constant  across  the  interface.  Boundary  conditions  on  the 
surface  ^  have  the  form 

Cn«(e;-?4]^--0_,  [^•a.-eE^)]^=  O  (1, 

Ch.-h>]j  =  o 

The  subscript  1  refers  to  the  fields  in  the  first  medium,  ’/’lila  the  sub¬ 
script  2  refers  to  the  fields  in  the  second  medium.  Assuming  the  normal 


6-8 


^viatiosis  C  si 

S,are  saell 


slopes  Y-^Cof  fehe  sarfsceX'relati^  to 


tSie  surface 


{k(rf^<l  , 


- 

X  ^<i 


s^iere^is  a  surface  gradieit  asdir'  is  the  rets  deviatitsi  of  ^ ,  zbs  bound¬ 
ary  coijditiOTi  (1)  may  be  transferred  to  the  oaan  surface  S  tlu:o=^h  a  Taylor 
series  espansicEi.  Higter  order  tenss  can  be  neglected  the  ccE^itions  (2). 
Tfe  assuee  the  fields  are  of  the  fom  of  a  inean  field  ^and  a  fluctuatiiHi 


field  e- 


H.“  )  H  =  9^'*'  ^ 


In  this  appraxination,  the  mean  field  coincides  v/ith  the  field  reflected 
frem  S/  'While  the  fluctuation  field  is  small  relative  to  the  mean.  The 
boundary  conditions  are  thus  ret-nritten 

L" « (i, ' ^  H) 

nv(&,-e-L)  =  iY X  (£,-^2)]  ~  ^  f  £,-£2)]  i  1 


n  X  (?,  -KO  =  ix  ^  t  r  (5) 


Equivalent  current  densities  j  and  j  electric  and  magnetic,  respectively, 

s  ^  ^ 


are  defined  as 
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Ty  expressing  earns  of  £,(thrtJU^  bamaary  cra^tions  4)  in 

terns  of  C.  {.Mtsxsa^  IfeSBll’s  egs)  and  are  stressed  in  terns  of 
tnXBS  - 

je=  Ikc  jzf. 

^TT  " 

(7) 

>  =  4  Lh.|7(n.£0C] 

*1518  general  field  e, ^2)  ^  point  above  (belcc?)  the  dielectric  is  expressed 

as  a  sum  of  plane  t-aves,  i-e. 

J  (8) 


^vhere  K.=  r  +  ZZ  ;  k-|k!/Sj.  }  ^ 

m 

Each  cesnponent  plane  ’.■rave  must  satisfy  (from  Majarell's  ess) 

h.f-p  =  e,  )  [h  ^  j® 

\  *1 


^diere 


'T  I  f  -L  K'Y 

I  je,„  0  Jr 


a=n'jS  I  b=  y  6-i  + 


Fran  these  expressions,  "e,  (K),  0j  (V<)  are  found  to  be 


) 


ill) 


’J-)  * ja)]  } 

Z.^e.  .J-  U-^.e.)  . 

'Ehe  field  rai  tl^  surface  2=0  is  irrittai  usi*^  tl^  Fresi^  fornsilas  for  an 
incident  field 


'Where 


-?•  ICzZ.) 


(12) 


^diere  is  the  distance  frcra  the  source  to  the  c^ter  of  the  scattering 


surface  and  p  is  the  polarization  vector.  Hius  it  can  be  derived 


c>o+  6<5< 


where 


Qo  -  -K1 ‘Of  lio  -  J  &  -coi^'^jP 


-A 

Substituting  eq  (13)  fore,  into  eqs  (7)  for  and  ;  eqs  (7)  into  eqs  (10) 
for  jg^and3in  ;  and  eqs  (10)  into  eq  (8),  the  field  is  found  at  any  point 
in  space.  The  integral  may  be  evaluated  by  saddle-point  integration  in 
view  of  the  fact  that  the  observation  point  is  located  in  the  Fraunhofer 
zone.  For  the  backscatter  case  where  k;  =~ICt/ 


TT 


■*  t>  ^ 

^  (14) 


zeroth  order  field  E  is  simply  the 


surface 

For  the  case  of  a  small  scal^only  the 
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and  can  be 

field  reflected  froa  a  plane  interface^^ound  usii^  Eresi^  coefficirats- 
lajsn  tbe  SFM  is  airalied  to  a  cocEXJsite  surface/  the  Eethodolociy  is  similar/ 
hut  more  ctzsersaae.  For  the  tao  scale  ca^,?  se  have  a  surface/  denoted  hy 
^  /  reprasaited  nonnal  deviations  ^(^5 )  from  a  aaoother  surface  S. 


Ihe  position  of  points  on  2  is  related  to  points  on  S  by  r£  =  r^  -r  N^Cr^ ) , 
uhere  Tj  is  the  normal  to  the  mean  surface  S.  The  normal  to  Z  uill  be  de¬ 
noted  n-  For  a  medium  of  relative  permittirdty  <£.  below  the  surface  and 
vacuum  above/  the  boundary  conditions  are 

,  £>.  fE-e  E^)]^  ,  O 

(IS) 

-  O 

I 

Again,  assuming  the  surface  deviations  from  S  are  small,  with  gentle  slopes 
such  that 


(16) 


ivhere  0^  =  Var  relative  to  S,  thtn  an  ezcjansion  of  the  fields  in  terms 
of  powers  of  ^  is  valid,  keeping  only  the  first  two  terms 

£  (fj  )  ~  E'r.)  •*  ^rr,)(N 


07) 


H  ('4')  -  f>  'I  ,  V --  V,  C('ii) 
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Hie  scatterecl  field  is  again  assuned  to  be  of  the  form  c  =£-5-  e,  H  =Jf-rh 

O 

and  the  analysis  proceeds  in  a  similar  fashion  to  the  analysis  i'or  a  small 

scale  surface  rally.  In  both  cases  the  final  form  for  the  field  scattered 

from  the  small  scale  roughness  is  the  same,  the  only  differaice  being 

ar^i^rs  in  the  eiqxinential  e  *  for  the  conroosite  surface  case.  I 

the  limiting  case  is  talcen,  lim  (perfectly  conducting  surface),  then  the 

e->«o 


field  becomes 


TtRo 


(18) 


a  simpler  form  of  (14)  to  work  mth.  Ihe  zeroth  order  field  is  found  in  a 
much  more  direct  fashion  to  be  [12] 


for  the  perfectly  conducting  surface  case. 

DEVELOPMEM'  OF  THE  SIRV  FORM 

In  general  the  SPM  applied  to  ccxnposite  surface  of  large  and  small  scale 
variations^ and  ^respectively,  predicts  a  scattered  field 


kT  -kz  Ca] 


clxd/  0^) 


i.e.  a  sum  of  a  zeroth  order  field  and  a  first  order  fluctuation  field. 
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Brown  derives  these  components  (for  a  perfectly  conducting  surface)  as 


a  ' 


^  [7^ 


PP  = 


rrr 


V2. 


c, 


^7-1 

ey 


_ X  ^ 

-  b<  ^Xp  (-] 


Copp  ^x,o(f--jfcr)  I 

^rrr  CoS  0  J  J  - 

A 

'Where  E  is  the  amplitude  of  incident  field 
kl  specifies  the  propagation  vector  of  the  incident  field 

=  XX  +  ^  +  z  is  the  position  vector  from  the  origin  to  any  point 

on  the  large  scale  surface 
♦ 

n  is  the  unit  normal  to  the  large  scale  surface 
k  =  j  ktj  is  the  vravenun'ber 

P  ,  p'  denote  incident  and  backscattared  polarizations^ respectively 


- 1  r, 


C  -  ^ 

on  \^  - 

dy 


) 


k' 

s  unit  vector  in  the  direction  of  incident  propagation 
r  is  tne  distance  from  the  transmitter-receiver  to  the  origin 

'frp' 


A  represents  the  beam  illuminated  area  (radar  footprint) 
©•  is  the  angle  of  incidence  =  ^  ^  7  ^ 
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7 


^  ^  -o 

The  incident  field  Ei  =  pt  xl  is  assumed  to  be  a  plane  irave  with  time 


dependence  suppressed. 

As  previously  mentioned  the  term  Ea  represents  cne  scattered  field  from  the 
large  scale  surface  variation  only.  It  is  well  kno^m  that  the  £©  term  is 
strongest  in  the  specular  direction  and  falls  off  rapidly  outside  a  cone 
centered  axially  on  the  specular  ^ravevector.  I'Jhen  this  general  result  is 
specialized  to  radar  backscatter  the  implication  is  that  the  zeroth  order 
field  has  its  strongest  effect  on  the  return  signal  at  or  near  normal  in¬ 
cidence  0=  0°.  It  will  in  fact  dominate  the  radar  return  if  the  radar  re¬ 
ceiver  is  located  within  the  cone.  It  can  also  be  shovm  [10>chap  5]  that 
the  spread  of  the  cone  is  proportional  to  rms  height  cone  width  (in  degrees) 
oc  •  In  the  limiting  case  as  (r->>  O  the  surface  becomes  a  plane  and 

reflection  occurs  in  the  specular  direction  only  (Snell's  Law),  which  is 
intuitively  satisfying. 

Now  the  desired  mathematical  form  sought  is  a  product  of  a  Gaussian  random 
variable  and  sane  scalar.  To  simplify  the  form  of  the  backscattered  field 
we  assume  that  the  slope  m  of  the  large  scale  variation  is  less  than  0.2. 
Ulaby,  Moore  and  Fung  [13,  chap  12]  have  calculated  backscattering  coeffici¬ 
ents  due  to  large  scale  rougliness  using  the  tangent  plane  method  which  shows 
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that  large  scale  backscatter  becomes  very  snail  for  angles  of  incidence 
greater  than  30°  if  the  large  scale  rms  slope  is  less  than  0.2.  Under 

jk. 

these  conditions/  the  first  order  field  e  will  dcaninate  the  radar  back¬ 
scatter.  By  grouping  terms  of  "e  which  depend  only  on 

scattered  field.  has  the  form  (subject  to  the  aforementioned  limitation) 


The  basic  area  of  integration  A  for  a  single  look  of  the  radar  (sometimes 
called  the  radar  footprint)  can  be  broken  up  into  a  summation  over  smaller 
subareas  or  patches.  Since  and  vary  much  more  "slowly"  over  the 

footprint/  a  logical  way  to  break  up  the  integral  is  by  patches  over  which 
and  VXsi  remain  fairly  constant.  Doing  so  yields 


or  more  compactly 


where 


is  now  of  the  form 
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Now  consider  a  general  tern  of  the  series 


.If  the  size  of  Ap>>J^  where  Jl  is  the  correlation  length  of  ,  then  the 
integral 


can  also  be  expressed  as  a  sum  over  smaller  areas 


^  ^  I  ^ 


1 

Yr\ 


.y 


Each  ^  random  variable  depending  on^^.  By  choosing  the  subpatch  area 

as  a  square  of  dimensions  ^  ,  then  each  subpatch  will  be  independ¬ 

ent  of  the  others.  Neighboring  subpatches  would  be  somewhat  correlated, 
but  here  we  will  assume  that  the  effect  is  small  and  neglect  it.  The  mean 

of  X 


-  f  (  e.  -  0 


since  by  definition  =  0. 

The  variance  of  Xm 


)<d  X  dy  d  U  d  Y 
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-  I  y  ^  <^Y^Ayd[Aiiv 

by  stationarity  of  ^  ^  i.e. ,  if  is  spatially  homogeneous. 

This  fourfold  integral  depends  only  on  the  differences  x-u,  y-v  and  ^dll  be 
the  same  for  every  subpatch.  If  the  number  of  subpatches  is  large  enough, 
then  by  the  central  limit  theorem,  a  zero  mean  Gaussian  random  variable 

(q,  the  same  for  all  patches.  Then 

—I 

t-  ~  (q  ~  ^  where  S  -  ^  , 

'  P  P 

This  is  precisely  the  form  of  an  SIRV.  As  the  radar  beam  moves  in  time, 
either  by  scanning  or  as  it  is  attached  to  an  aircraft,  the  returns  will  be 
samples  of  a  process,  an  SIRP. 

CONCLUSION 

A  radar  clutter  model  derived  frcan  electromagnetic  principles  has  been  sho^m 
to  have  the  proper  form  of  a  spherically  invariant  random  process.  Charac¬ 
terization  of  radar  clutter  as  an  SIRP  frcsn  theory  may  provide  an  important 
link  to  predicting  the  specific  clutter  statistics  based  on  terrain  topology 
and  material  parameters  {yL\,  £)  that  maJce  up  the  surface.  It  should  be  noted, 
however,  that  a  more  general  formulation  of  a  radar  clutter  model  should  in¬ 
clude  volume  scattering  (by  vegetation)  as  well  as  surface  scattering,  and 
also  provide  for  near  normal  incidence  and  near  grazing  incidence. 
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&BSIBACI 

This  is  the  final  report  concerning  results  of  a  susmer  research 
■program  at  Rose  Laboratories.  Four  separate  areas  of  research  were 
investigated.  The  first  areas  focused  on  analyzing  the  mechanisns  of 
photorefraction  in  Czochralski  (Cz)  and  Hydrothermal  Bij|^2^^^20  (®S0) 
grown  at  Rome.  Deep  defects  in  the  bandgap  of  BSO  were  characterized 
and  defect  phenomena  related  to  photorefraction  measurements  in  BSO. 
Electrical  conductivity  was  measured  below  and  above  room  temperature 
for  different  samples  of  this  material.  Above  RT  conductivity  indicates 
a  rapid  increase  in  current  in  BSO  that  is  thermally  activated,  and 
appears  dependent  on  deep  defects.  A  deep  defect  of  1.3  +  0.1  eV  was 
found  and  measured  in  both  Cz  and  Hydrothermal  material.  Below  RT 
thermally  stimulated  current  (TSC)  studies,  however,  indicate  that  the 
concentration  of  traps  of  activation  energy  <0.7  eV  in  the  Hydrothermal 
samples  is  approximately  a  factor  of  10  smaller  than  in  Cz  BSO.  At 
least  five  different  defects  were  i.dentified  in  the  TSC  measurements. 
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Accivacion  energies  for  two  of  these  defects  were  deterained  from 
initial  rise  seasurements  of  TSC  data.  Shile  a  large  concentration  of 
shallow  traps  was  alw^s  found  in  Cz  oaterial.  the  identity  of  the  sost 
dCTinant  shallow  trap  was  not  always  the  sane.  Importance  of  this  data 
on  photoconducting  and  photorefracting  qualities  of  BSO  are  given. 
Suggestions  for  further  studies  are  also  given.  Photorefraction  itself 
was  investigated  as  a  method  of  characterizing  BSO  and  semi -insulating 
InP:Fe.  A  report  of  photorefraction  in  InP:Fe  is  also  given. 

I.  INTRODDCnON 

Photorefraction  is  a  name  given  to  a  non-linear  optical  effect 
observed  in  certain  insulators  and  semi -insulators.  Photorefraction  is 
a  method  of  writing  volume  holograms  which  are  dynamic,  i.e.  volume 
gratings  which  erase  after  cessasion  of  exposure  with  time  constants 
which  vary  from  milliseconds  to  several  months.  Since  it  is  a  non¬ 
linear  effect,  it  holds  promise  for  many  of  the  suitable  applications  of 
non-linear  mixing  including  image  processors;  phase  conjugation  using 
degenerate  four-wave  mixing;  optical  devices  such  as  optical 
oscillators;  and  as  a  technique  useful  for  non- destructive ,  non 
contacting  materials  analysis  [1-4]. 

Photorefraction  has  been  observed  in  several  insulators,  mostly 
ferroelectrics  with  large  electro-optic  coefficients  and  smallr 
dielectric  constants.  In  this  study  photorefraction  and  general  defect 
properties  were  investigated  in  cubic  Bi^2Si02o  (BSO)  and  semi- 
insulating  iron-doped  InP  (InP:Fe).  Introductory  studies  of  BSO  single 
crystals  grown  at  Rome  Laboratories  at  Hanscom  AFB  are  part  of  a  long¬ 
term  investigation  relating  defect  properties  of  BSO  to  its 
photorefractive  properties.  The  empl^asis  of  photorefraction  in  InP:Fe 
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was  twofold:  as  an  attempt  to  demonstrate  photorefraction  in  this 
material  using  low  power  (HeNe)  and  incoherent  (LED)  sources,  and  to 
later  demonstrate  the  technique  as  a  method  of  nondestructive  and 
contactless  testing  of  this  material. 

An  optimum  photorefractive  crystal  must  meet  the  following 
requirements : 

1)  be  fairly  transparent  at  the  wavelength  of  interest, 

2)  have  a  relatively  large  amount  of  deep  defect  centers  in  its 
bandgap, 

3)  possess  a  large  electro-optic  coefficient/dielectric  constant 
ratio,  and 

4)  be  a  relatively  good  photoconductor. 

A  common  theory  given  for  photorefraction  is  the  single -carrier  model 
summarized  mathematically  by  Kukharev  [5].  Two  coherent  pump  beams 
enter  the  sample  with  an  angle  of  2  between  them,  as  shown  in  Fig.  (1) . 
The  single  carrier  model  of  what  happens  as  they  interact  is  shown  in 
Fig.  (2).  Where  the  coherent  pump  beams  mix  they  form  an  intensity 


interference  pattern  of  alternating  light  and  dark  areas  that  can  be 


modeled  as: 


X  fe)  -  Xo  +3 ,  e'  ^  ■  G) 


In  the  high  intensity  regions  of  the  interference  pattern,  the  light  is 
absorbed  in  the  crystal  as  it  alters  the  population  of  deep  traps  in  the 
bandgap.  In  the  single  carrier  model,  the  light  is  absorbed  by  exciting 
electrons  (holes)  to  conduction  band  (or  valence  band) .  The  free 
carriers  diffuse  (or  drift  if  an  external  electric  field  is  applied) , 
to  dark  areas  of  the  interference  pattern  where  they  recombine.  The 
result  is  a  periodic  space-charge  region,  as  shown  in  Fig.  2b.  This 
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space  charge  region  sets  up  a  periodic  electric  field,  the  anqilitude  of 
^ich  (with  no  external  field)  is  determined  as: 

-  -  6  ^JL 

Xo  14 

where  B  ^  —  kgT  ^ 

The  resulting  electric  field  causes  a  periodic  change  in  the  refractive 

IT 

index  from  the  electro-optic  coefficient,  defined  from 

/\y\  cc 

The  resulting  refractive  index  modulation  is  modeled 

The  result  of  the  mixing  of  the  two  pump  beams  is  a  refractive 
index  grating  having  grating  spacing  given  by  Bragg's  Law 

Vs  ~  ^  "A-  sij7  ©  (^S~ 3 

The  result  is  a  refractive  index  grating  of  spacing  iX.  from  which  a  third 
beam  of  light  (referred  to  here  as  the  "probe")  can  diffract  in  a 
similar  way  to  which  a  beam  may  diffract  from  a  fixed  etched  grating. 
The  probe  beam  (shown  dashed  line  in  Fig.  1)  in  this  study  was  always  of 
a  wavelength  larger  than  the  pump  beam,  and  thus  diffracts  at  a  Bragg 
angle  larger  from  the  pump  beam  as  demanded  by  equation  (5).  This  is  the 
so-called  non- degenerate  four-wave  mixing  configuration.  The  probe  beam 
produces  a  diffracted  fourth  beam  which  can  be  used  as  a  means  of 
monitoring  the  grating. 

The  fraction  of  the  probe  beam  diffracted  depends  only  on  the 
relative  magnitude  of  the  pump  beams,  and  not  on  their  absolute 
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magnitude.  Th:>s  relative  magnitude  of  the  puiq>  beams  is  vhat  determines 
the  modulation  index  m  —  . 

The  time  dependence  of  the  index  grating  is  determined  by  [6] : 

1)  Dielectric  relaxation  time, 

2)  Diffusion  length  of  the  photocarriers, 

3)  Debye  screening  length. 

Which  of  these,  or  combination  of  these  parameters  is  important  is 
dependent  upon  the  intensity  of  the  ligjht  and  the  grating  period.  Uhen 
making  a  grating,  the  rise  time  of  the  diffraction  efficiency  increases 
if  total  writing  intensity  increases.  The  writing  and  erasing  rates 
increase  if  the  angle  between  the  pump  beams  increases  [7] ,  Erasing 
times  can  also  be  increased  with  an  erase  beam,  which  offers  a  very 
short  relaxation  time  by  altering  the  photoconductivity  of  the  sample. 

The  significance  of  Eq  (2)  being  imaginary  is  a  1/4  period  phase 
shift  between  the  electric  field  and  the  light  intensity.  This  phase 
shift  can  be  observed  by  comparing  Fig.  2a  to  2d.  This  1/4  period  shift 
allows  one  pump  beam  to  self-diffract  from  the  grating  into  the  other 
beam.  Thus  a  power  transfer  between  the  two  pump  beams  exists,  termed 
beam  coupling.  Beam  coupling  can  be  observed  without  the  use  of  the 
probe  beam.  For  some  ferroelectrics ,  the  magnitude  of  energy  transfer 
in  two-beam  coupling  can  approach  100%,  one  transmitted  pump  beam 
emerges  extinguished  compared  to  the  other. 

If  a  probe  beam  of  the  same  frequency  and  polarization  as  the  pump 
is  used  incident  with  one  pump  beam;  in  so  called  degenerate  four-wave 
mixing,  then  the  fourth  beam  is  diffracted  at  an  angle  along  the 
direction  as  the  other  pump  beam.  Do  to  the  non-linear  nature  of  the 
mixing  in  the  crystal  this  fourth  beam  is  the  phase  conjugate  of  the 


pump  beam  it  follows  along.  Do  to  the  possibility  of  beam  coupling 
explained  above,  i.e.  one  pump  beam  is  diffracted  into  the  other,  a 
reflection  of  one  of  the  pump  beams  back  into  the  crystal  can  be  used  as 
the  third  beam  as  "self  pumped"  phase  conjugation,  that  is,  phase 
conjugation  using  only  one  beam  from  one  laser  [8]. 

In  spite  of  the  simplicity  of  Kukharev's  model,  it  appears  that 
photorefraction  in  BSO  and  InP:Fe  involves  a  more  complex  mechanism.  In 
fact  two-carrier  mechanisms  have  been  suggested  for  both  BSO,  and  InP. 
For  BSO,  it  is  apparent  that  a  hopping  conduction  typical  of  insulators 
is  involved  [9].  Hopping  takes  place  when  carriers  move  between 
localized  centers  in  the  bandgap. 

Photorefraction  is  unique  in  that  creating  the  non-linear  effect 
does  not  require  high  intensity  sources.  This  is  because  the  grating 
modulation  index  m  does  not  depend  on  the  absolute  intensity  of  the 
pump-beams,  but  rather  on  the  ratio  of  pump  beam  intensities.  Thus  it 
is  conceivable  that  in  the  right  materials,  it  could  be  observed  using 
available  low-power  coherent  sources.  The  speed  of  the  effect  does, 
however  depend  on  light  intensity. 

In  this  study  several  samples  of  BSO  grown  by  Czochralski  (Cz)  and 
Hydrothermal  techniques  were  studied  in  an  attempt  to  compare  their 
material  defect  properties  to  the  photorefractive  efficiency  and  time 
constants  of  the  materials.  Four  Cz  samples,  and  one  hydrothermal 
sample  were  measured.  These  samples  were: 

BSO  26  6 -9s  purity  Cz  grown 

BSO  45  5 -9s  purity  Cz  grown 

BSO  55  Cz  grown  from  a  low-purity  Hydrothermal  charge 

BSO  72  5 -9s  purity  Cz  grown  in  Bi203  rich  conditions 

BS  40  Hydrothermal  grown 

BSO  51  5 -9s  purity  Cz  irradiated  with  50Mrad  gamma  radiation 

Itek  Commercial  BSO  sample 
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All  samples  except  BS  40  were  yellowish  in  color.  Sample  BSO  72  had 

several  inclusions  and  was  thus  unsuitable  for  photorefraction  studies. 

All  samples  except  BSO  26  and  commercial  sample  were  cut  and  polished  at 

Rome  for  photorefraction  measurements.  Separate  samples  were  used  for 

hi^  temperature  conductivity  and  low  temperature  thermally  stimulated 

current  (TSC)  measurements.  Photorefraction  measurements  were  made 

using  non- degenerate  four  wave  mixing  to  measure  diffraction 

efficiencies  and  grating  decay  times.  Preliminary  results  on 

photorefraction  in  InP  using  infrared  pumps  and  probes  is  also  reported. 

Each  section  will  be  reported  separately. 

II,  HIGH  TEMPERATURE  ELECTRICAL  CONDUCTIVITY 

Electrical  conductivity  measurements  of  BSO  above  room  temperature 

was  attempted  so  that  trap  or  traps  having  thermal  activation  energies 

greater  than  0.7  eV  could  be  thermally  emptied  and  thus  characterized. 

II. A.  HIGH  TEMPERATURE  CONDUCTIVITY  EXPERIMENTAL  PROCEDURE. 

Measurements  were  made  in  the  dark  on  samples  from  8j0  to  450  C. 
2  3 

A  field  of  10  to  10  V/cm  was  applied  constantly  for  the  entire  heat  and 
cool  cycle,  and  D.C.  current  was  monitored  using  a  picoammeter.  A  five 
hour  linear  heat  rate  between  85  C  and  450  C  was  followed  by  a  three 
hour  soak  at  450*C  and  then  a  five  or  eight  hour  cool  time.  Samples 
were  contacted  at  two  points  using  either  indium  or  graphoil  contacts. 
Prior  to  measurements,  indium  contacts  were  pressed  and  then  heated  on  a 
hot  plate  until  the  indium  wetted  the  sample  surface. 

II. B.  HIGH  TEMPERATURE  CONDUCTIVITY  RESULTS. 

Obtaining  measurements  of  the  high  temperature  conductivity  of  BSO 
was  complicated  somewhat  by  the  rapid  onset  of  space  charge  effects. 
Space  charge  effects  were  not  observed  below  400 "c  at  the  low  elfc.^ric 
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fields  used,  but  between  300  and  400*C,  measured  DC  currents  for  BSO 
samples  would  increase  dramatically,  then  saturate  when  450 was 
reached.  A  typical  scan  is  shown  schematically  in  Fig.  3.  Upon 
cooling,  the  current  would  exhibit  a  hysteresis,  and  not  return  to  its 
initial  value  for  several  hours.  At  this  point,  the  sample  would  be 
electrically  stressed,  or  polarized.  Upon  removal  of  the  electric  field 
at  85*,  a  reverse  polarization  current  of  several  tens  of  picoamps  would 
flow  for  an  indeterminable  amount  of  time. 

This  reverse  current  could  be  observed  for  all  samples  tested, 
including  hydrothermal  BSO.  For  some  samples,  including  the 
hydrothermal  grown  sample,  this  reverse  current  could  be  temporarily 
quenched  with  illumination  of  an  unfiltered  low-power  tungsten  light 
source.  Current  would  resume  after  cessation  of  the  illumination.  Due 
to  the  speed  of  this  photoconductive  quenching,  it  was  concluded  that 
the  polarization  was  electronic  and  not  ionic  in  nature. 

One  of  the  more  noticeable  properties  of  the  data  in  Fig.  3  is  its 
reproducibility  in  an  unstressed  sample -and  reproducibility  from  sample 
to  sample.  Due  to  the  dramatic  (four  orders  of  magnitude)  rise  in 
current  through  the  same  temperature  range,  it  is  likely  that  there  are 
deep  traps  emptying  contributing  carriers  which  in  turn  build  a  space 
charge  effect.  If  an  unstressed  sample  is  measured,  the  initial  rise 
data  from  Fig.  3  can  be  plotted  in  an  Arrhenius  plot  and  the  activation 
energy  determined.  Figure  four  shows  the  Arrhenius  plots  of  a  Cz  and  a 
Hydrothermal  sample.  Activation  energies  of  1.3  and  1.1  eV,  were 
determined,  respectively,  from  a  linear  regression  analysis  of  the  data. 
Oberscmid  [11]  also  observed  an  activation  energy  of  1.1+  0.07  eV  on 
unstressed  BSO  samples.  Determining  activation  energies  from  this 


technique  at  these  tejnperatures  is  hampered  by  the  possibility  of  charge 
retrapping.  This  re  trapping  is  also  more  dominant  if  the  trap  of 
interest  is  only  partially  filled,  thus  some  variation  in  activation 
energies  is  to  be  expected. 

It  is  interesting  to  note  that  similar  deep  defects  were  observed 
from  the  high  temperature  measurements  in  both  hydrothermal  and  Cz  BSO, 
suggesting  there  does  exist  deep  defects  in  the  hydrothermal  material 
which  can,  at  least,  be  thermally  emptied.  This  does  not  explain  why 
such  traps  are  not  observed  by  optical  absorption.  It  is  possible  that 
such  traps  may  not  fill  unless  slightly  heated  above  room  temperature. 
Tills  hypothesis  may  be  supported  by  the  fact  that  clear  hydrothermal 
samples  were  observed  to  darken  when  heated  on  a  hot  plate  under  ambient 
conditions . 

Electrical  stressing  has  been  observed  by  Efendiev,  et  al  [10]  in 
BTO  and  by  Oberschmid  [11].  Oberschmid  determined  that  stressing  was 
caused  by  regions  of  a  negative  space  charge  which  slowly  builds  up  as 
electrons  are  injected  from  the  cathode  and  eventually  the  space  charge 
region  shunts  the  electrodes.  Efendiev  observed  that  in  BTO  the  effect 
could  last  for  several  months  in  the  dark.  No  attempt  to  determine  time 
constants  of  electrical  stressing  was  attempted  in  this  study. 

Sample  BSO  45,  after  ,  cycles  between  85  and  450  at  10^  V/cm, 
was  visibly  observed  to  have  bleached  viue  to  the  treatment.  Sample  BSO 
26  was  not  observed  to  discolor,  however  was  a  lighter  color  initially 
than  the  other  Cz  samples.  It  was  concluded  that  the  treatment  either 
removed  an  impurity  in  BSO  45  not  in  BSO  26 ,  or  else  indium  from  the 
contact  was  drifted  into  the  sample.  As  Cz  BSO  intentionally  aluminum 
or  gallium  doped  at  Rome  is  obseirved  to  bleach,  it  may  be  expected  that 
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In  would  have  the  same  effect. 


II.  C.  SUGGESTIONS  FOR  FUTURE  WORK  ON  HIGH  TEMPERATURE  ELECTRICAL 
CONDUCTIVITY 

It  is  felt  that  the  high  temperature  current  measurements  could  be 
taken  with  fields  below  10  V/cm,  to  further  minimize  the  above  mentioned 
space -charge  effects.  A  much  lower  field  and  lower  heat  rate  would  help 
in  determining  the  thermal  activation  energies  of  any  traps 
participating  in  the  effect. 

It  was  observed  that  hydrothermal  samples  could  darken  when  heated 
above  approximately  150® C.  Deep  defects  could  account  for  this  change 
and  above  temperature  optical  measurements  of  this  material  could  be 
appropriate . 

III.  THERMALLY  STIMULATED  CURRENT  STUDIES  OF  BSO. 

In  order  to  characterize  the  more  shallow  defects  in  BSO  TSC 
measurements  were  carried  out.  Linear  heat  rates  could  be  obtained  from 
77K  up  to  approx  200  K, 

III. A.  TSC  PROCEDURE. 

Samples  approximately  1cm  X  1cm  X  3mm  were  contacted  with  rolled 
indium  1mm  X  1cm  strips  along  the  edges  of  one  face.  These  contacts 
were  heated  15  min  in  air  on  a  hot  plate  until  the  indium  wetted  the 
surface.  The  samples  were  mounted  in  a  cryostat  using  rubber  cement. 
The  samples  were  cooled  in  the  dark  with  no  field.  Once  liquid  nitrogen 
temperature  had  been  reached,  a  field  of  100  V/cm  was  applied  and  the 
sample  was  illuminated  with  unfiltered  light  from  a  200  watt  Hg  arc  lamp 
in  order  to  create  free  carriers  to  fill  traps.  Various  illumination 
times  from  2  min  to  15  minutes  were  used  with  no  great  observable  effect 
on  the  resulting  TSC  spectra.  Ten  minute  illumination  times  were  more 
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typical.  After  illumination,  the  samples  were  then  warmed  in  the  dark  at 
a  heat  rate  of  1.5  C/hr. 

III.B.  TSC  RESULTS 

Typical  results  of  TSC  scans  are  shown  in  Fig.  5.  Three  groupings 
of  peaks  are  normally  observed  for  each  sample.  Fig.  5a  shows  at  least 
two  peaks  exist  in  group  labeled  "A".  Peak(s)  A  is  a  major  shallow 
peak  of  significant  concentration  which  was  observed  for  most  Cz 
samples . 

A  rough  estimation  of  trap  concentration  can  be  determined  by 
assuming  all  traps  were  filled  by  the  filling  illumination  and  then 
measuring  the  integrated  charge  under  each  peak.  Results  for  several 
BSO  samples  show  in  in  Table  I. 

Table  I.  Measured  integrated  charge  for  the 
peaks  found  in  TSC  measurements  of  BSO  samples. 

All  concentrations  are  in  cm*^. 


Sample 

Peak  A 

Peak  B 

Peak  C 

Peak  D 

BSO  45 

BSO  72 

BSO  51  Irrad 

BSO  55 

Itek 

1.1X10^^ 

7X10^"* 

4.4X10^^ 

N/A* 

6 , 7X10^^ 

10l2 

loJ-2 

10^2 

3X10^^ 

2.|JX10^^ 

“u 

5X10jf 

4X10^^ 

10^^ 

10“ 

10^2 

1A 

2X10^^ 

Hydrothermal 

9X10^° 

N/A 

*N/A  means  the  peaks  were  not  observed  for  these  samples. 

In  Table  I,  it  is  apparent  that  a  dominant  shallow  (<0.3eV)  trap 

13  3 

of  concentration  greater  than  5  X  10  cm*  was  observed  in  all  Cz  grown 
BSO.  However,  comparing  Sample  BSO  45  and  BSO  55,  which  was  grown  from 
an  impure  hydrothermal  charge.  It  is  evident  that  the  dominant  shallow 
trap  is  different  for  these  two  samples.  TSC  spectra  for  BSO  45  and  55 
are  shown  in  Fig.  5.  It  was  observed  that  both  of  these  samples  were 
yellowish  and  were  also  both  photorefracting. 

In  Table  I  and  Fig.  5b,  it  is  observed  that  a  significantly  lower 
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concentration  of  traps  was  found  in  the  hydrothermal  sample,  although 
peaks  A  through  C  were  observed.  This  sample  was  not  photorefracting . 

In  section  II  it  was  observed  that  a  deep  center  was  observed  in 
all  BSO  samples,  both  Hydrothermal  and  Cz  grown.  It  is  interesting  that 
samples  which  are  photorefracting  also  have  large  concentration  of 
shallow  levels,  however  it  does  not  seem  important  exactly  which  shallow 
levels  are  present,  as  long  as  at  least  5X10  cm  are  indeed  present. 
There  may  be  a  connection  between  shallow  defect  concentration  and  the 
optical  absorption  of  BSO.  If  there  are  not  enough  shallow  traps  in  the 
Hydrothermal  material,  deep  defects  present  may  be  above  the  Fermi  level 
and  are  thus  empty  at  room  temperature  and  are  noi.  emptied  by  room 
temperature  optical  absorption.  These  traps  could  therefore  not 
participate  in  photorefraction. 

From  the  initial  current  rise  of  Peak  A  in  sample  BSO  45  and  Peak 
B  in  sample  BSO  55,  thermal  activation  energies  were  determined  as 
described  by  Milnes  [12].  Initial  slope  of  the  data  was  fitted  to 

For  these  measurements,  the  TSC  scans  were  repeated  at  higher  gains 
compared  to  those  used  to  record  figure  5  in  order  to  obtain  accurate 
readings  of  the  initial  rise  in  thermal  currents.  Arrhenius  plots  for 
these  two  peaks  are  shown  in  Fig.  6.  Activation  energies  determined 
from  a  least-squares  analysis  gave  an  activation  energy  of  0.17  eV  for 
Peak  A  and  0.215  eV  for  Peak  B. 

It  is  also  interesting  to  note  in  Table  I  that  the  Itek  sample  has 
higher  integrated  charge  under  Peaks  B  through  D  as  compared  to  those 
grown  in  Rome. 

It  was  also  observed  that  BSO  55  was  not  a  good  photoconductor 
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under  the  un- filtered  Hg  light,  at  any  temperature.  However  sample  BSO 
55  is  still  a  reasonable  photorefractor,  as  observed  in  Section  IV. 
Thus  a  study  relating  photo con -activity  to  efficiency  and  speed  of 
photorefraction  is  probably  warranted.  The  effects  of  photoconductivity 
quenching  via  the  tungsten  light  as  observed  above  may  have  some  effect 
on  photorefraction.  It  has  been  observed  that  infrared  illumination 
does  increase  photorefraction  efficiencies  [14], 

III. C.  SUGGESTIONS  FOR  FURTHER  TSC  WORK. 

The  ease  of  the  TSC  measurements  have  indicated  that  this  could  be 
a  good  technique  for  monitoring  BSO  samples  grown  at  Rome.  It  is  felt 
that  a  definite  correlation  could  be  made  between  the  purity  of  the 
starting  Cz  charge  and  the  type  and  height  of  the  TSC  peaks  observed. 
It  may  prove  that  in  order  to  make  hydrothermal  samples  photorefracting, 
shallow  impurities  may  have  to  be  added  to  change  the  way  charge  is 
compensated  by  deep  states.  A  lack  of  shallow  impurities  may  move  the 
Fermi  level  in  such  a  way  as  to  empty  deep  traps  that  are  normally 
filled  with  electrons  in  Cz  material. 

A  cryostat  suitable  for  TSC  needs  to  be  developed.  Such  a 
cryostat,  where  heat  rates  are  more  linear,  can  be  warmed  through  room 
temperature,  and  where  heat  rates  can  be  varied,  is  needed  to  determine 
more  accurate  trap  activation  energies. 

IV.  PHOTOREFRACTION  OF  BSO. 

The  samples  were  next  measured  for  photorefraction  efficiency,  as 
measured  by  non-degenerate  four  wave  mixing,  and  grating  time  constant. 
These  measurements  were  undertaken  to  determine  the  feasibility  of  using 
photorefraction  to  characterize  BSO  grown  at  Rome  Labs. 
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IV. A.  ESOICHESEACnffiS 


A  schematic  of  piiotorcfeaccioa  sysceg  is  sasowa  isa  Fig.  5.  For 
BSO  m»a«aaT'«»!iT\yTnir«; ,  yjie  p*rirp  beam  vss  a  A6S  t?™  liae  fro3s  aa  Airgoa  laseir 
^TtA  a  633  ms  HeSe  beass  vas  used  for  rbe  probe.  Ibe  paoip  beams  Co  cbe 
sarple  fross  tiie  Argci>  laser  were  created  fror  a  cube  beam  splitter.  Ibe 
sample  vas  placed  in  the  crossed  pmip  beams  wisicb  made  an  angle  of  14 
between  one  another.  The  probe  beam  from  the  HeSe  was  bounced  from 
airror  "A"  onto  the  sample.  A  target  was  used  to  mark  the  relative 
positions  of  pu:^  beams  and  probe  beam.  The  target  was  first  placed  in 
the  puTgp  beams,  at  a  distaiure  of  12  inches  in  front  of  the  satnile,  and 
the  positions  where  the  pump  beams  intersected  the  target  was  narked. 
The  target  position  (12  inches)  is  also  the  position  ihere  nirror  “A*  is 
placed.  The  Bragg  angle  for  the  pump  beans  can  be  calculated  from 
measuring  the  distance  between  pump  beans  marked  on  the  target.  The 
Bragg  angle  for  the  probe  can  next  be  calculated  and  its  relative 
position  on  the  target  can  be  narked.  This  is  the  point  idiere  the  probe 
beam  will  incident  the  mirror.  The  HeNe  laser  is  mounted  such  that  the 
incident  spot  on  mirror  "A“  could  be  x-y  translated.  The  HeNe  beam  is 
translated  until  the  red  light  meets  the  mirror  at  the  point  marked  on 
the  target.  This  would  bring  the  probe  within  a  few  degrees  of  its 
Bragg  angle,  and  with  minor  adjustments  of  mirror  "A"  from  that  point 
the  diffracted  beam  could  be  maximized.  A  1.35^m  probe  beam  from  a  44 
microwatt  LED  was  attempted  but  was  difficult  to  observe 
photodiffraction  due  to  the  large  Bragg  angle.  At  a  large  angle,  the 
errors  on  the  target  are  large,  and  the  probe  beam  is  easily  moved  off 
the  sample  with  even  minor  corrections  of  mirror  "A"  to  the  incident 
angle.  Both  pump  and  probe  lasers  were  vertically  polarized. 
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For  efficiency  and  rise  consrsnc  seasurenencs ,  Che  diffracted 
probe  was  scoaicored  with  a  silicon  photcdetector.  The  incident  probe 
bean  was  modnlated  at  30  Hz  and  a  lock-in  detection  system  was  used. 
Keucral  density  filter  of  O.D.  —  2  or  4  was  iiiserced  in  the  incident 
probe  beam  to  insure  that  the  probe  would  not  erase  the  grating  and  that 
the  silicon  detector  wotild  not  be  exposed  to  the  hig^i  irradiance  of  the 
diffracted  bean. 

I9.B.  mxjxsEsz&cnos  hesdlts. 

Efficiency  neasureoencs  were  roughly  the  same  for  all  saasples,  at 
2.3  X  10^  for  5id?  probe  altenuated  by  O.D.  -  2  filter.  Qhen  the  sample 
was  rotated  90®,  efficiencies  were  25«  hi^er  in  one  orientation  than 
the  ocher.  However,  these  measurements  are  not  satisfactory  for  the 
reasons  stated  below. 

Fhotorefraccion  measurements  were  hampered  by  several  problems 
which  made  reproducibility  of  measurements  difficult.  Reliable 
measurements  could  only  be  taken  when  the  system  was  in  perfect 
alignment.  Since  different  efficiencies  are  observed  for  the  two 
different  grating  vectors  that  are  possible  in  cubic  cyrstals,  the 
samples  must  be  rotated  and  measured  for  both  directions.  Since  only 
two  samples  were  cut  on  all  six  sides,  a  rotator  had  to  be  constructed 
which  would  rotate  a  sample.  Unfortunately,  the  rotator  also  moved  the 
sample  in  and  out  of  where  the  beams  crossed,  and  subsequently  once 
rotated  the  system  had  to  be  realigned.  Time  restraints  did  not  allow 
for  the  construction  of  a  better  rotator.  A  rotator  needs  to  be 
designed  which  would  allow  rotation  of  the  sample  about  a  fixed  point 
without  changing  the  angle  of  incidence  to  the  sample  surface. 

The  second  more  serious  problem  for  efficiency  measurements  is  the 
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effect  of  the  probe  beam  on  erasing  the  grating-  Probe  beam  of  633  nm, 
vben  attenuated  by  increasing  O.D.  filters,  gave  increasing  diffraction 
efficiencies.  With  the  probe  attenuated  by  O.D,  -  4,  the  efficiencies 
would  go  up  a  factor  10.  With  O.D.  greater  than  4  the  diffracted  beam 
could  not  be  obseinred  with  the  detection  system  used. 

Tine  constants  for  three  samples  shown  in  Fig.  8.  These  decays 
were  measured  in  the  dark.  Incident  probe  intensity  was  attenuated  by 
O.D.  —  2  filter  for  these  measurements.  However,  these  measurements 
suffer  the  same  problems  as  efficiency  measurements,  meaning,  the  effect 
of  the  probe  beam  in  erasing  the  grating  cannot  be  accounted  for.  If 
the  probe  is  attenuated  to  very  low  levels,  then  the  ambient  light  in 
the  room  would  erase  the  grating,  and  the  ambient  light  is  not 
reproducable  for  day  to  day  measurements. 

IV.  SUGGESTIONS  FOR  FURTHER  WORK. 

It  is  felt  that  the  effects  of  the  probe  intensity  could  be 
negated  in  the  time  constant  measurements  in  the  above  experiments  if 
the  grating  was  purposely  erased  with  light  of  known  wavelength  and 
intensity.  However,  the  time  constants  in  doing  so  are  shorter  than  the 
response  time  of  the  system  used  in  the  above  measurements.  In  the 
experiments  above,  the  time  constant  of  the  measurement  system  was 
determined  by  the  output  filter  of  the  lock-in,  or  the  highest  speed  of 
the  strip  chart  recorder  used,  typically  Im  sec. 

The  erasure-under-light  time  constant  will  depend  on  the 
photoconductive  dielectric  response  time  of  the  sample  when  exposed  to 
that  light,  which  is  a  material  parameter  and  thus  may  give  some 
information  concerning  material  quality. 

Efficiency  measurements  could  be  made  with  the  stronger  488  nm 
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pump  and  an  infrared  probe  which  is  not  abosrved  as  strongly  in  BSO  as 
the  red.  If  a  very  sensitive  detection  system  were  used,  efficiency 
could  be  measured  with  increasing  attenuation  of  the  probe  until  the 
measured  efficiencies  saturated,  and  thus  the  effects  of  the  probe  could 
be  discounted. 

V.  PHOTOREFRACTION  OF  InP. 

Photorefraction  experiments  were  attempted  on  InP:Fe  grown  at  Rome 
labs .  Samples  1  X  1  X  0 . 8  cm  were  cut  and  two  faces  along  the  short 
direction  were  polished.  These  faces  were  [110] ,  the  other  directions 
being  [110]  and  [100].  An  electric  field  of  3kV  was  applied  across  the 
[100]  direction.  For  this  purpose  ohmic  contacts  of  thin  indium  were 
rolled  onto  the  sample  sides  and  annealed  at  400  C  for  1.5  minutes. 

The  pump  beam  in  the  InP  experiments  was  a  1.15 yum  2  mW  HeNe 
laser.  The  probe  used  was  a  44 1.35  jfla  LED.  The  LED  was  collimated 
using  a  XIO  objective  lens.  The  laser  was  expanded  using  a  XIO 
objective,  and  2  inch  F.L.  convex  lens  and  spatially  filtered.  The  same 
set  up  for  photorefraction  of  BSO  was  used.  The  same  cube  beam  splitter 
and  mirrors  were  used  for  InP  as  for  BSO  photorefraction. 

V.B.  RESULTS  AND  SUGGESTIONS  FOR  FUTURE  WORK. 

Unfortunately,  photorefraction  was  never  observed  in  the  InP 
samples.  Difficulties  in  aligning  infrared  beams  were  encountered.  A 
serious  problem  with  the  InP  is  not  knowing  the  correct  material 
parameters  for  photorefraction.  This  is  a  basic  "chicken  and  egg" 
problem.  Optical  density  of  several  InP  samples  varied  from  0.6  to  1.0 
for  roughly  the  same  sample  thickness,  and  the  relation  between 
absorption  and  defect  quantity  in  InP  is  not  that  clear  cut. 

It  is  suggested  that  a  sample  of  InP  that  is  a  known 
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photorefractor  be  obtained.  With  this  sample,  the  low-power  system  used 
in  this  study  could  be  tested.  If  photorefraction  were  successful  at 
this  power  level  with  this  sample,  samples  grown  at  Rome  could  then  be 
tested.  Otherwise,  many  samples  grown  at  Rome  would  have  to  be  tested, 
initially  with  higher  power  lasers  than  those  used  in  this  study. 
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1.  Schematic  of  the  non-degenerate  four-wave  mixing  experiment  involved 
in  observing  the  photorefractive  effect. 

2.  Model  illustrating  the  photorefractive  effect  using  the  single¬ 
carrier  model. 

3.  Schematic  of  temperature  &  current  for  above  room- temperature 
electrical  conductivity  measurements,  showing  hysteresis  in  current. 

4.  Arrhenius  plot  of  the  initial  rise  of  high  temperature  conductivity 
for  different  BSO  samples.  Cross-sectional  areas  were  the  same  for  all 
samples  at  0.25  cm^. 

5.  TSC  measurements  of  a)  Cz  BSO  sample  BSO  45,  and  b)  Hydrothermal 
sample  BS  40,  c)  Cz  BSO  sample  BSO  55,  grown  from  Hydrothermal  charge. 

6.  Arrhenius  plots  of  the  initial  rise  of  TSC  peaks  "A"  of  sample  BSO 
45  and  "B"  BSO  55. 

7.  Schematic  of  Photorefraction  experiment, 

8.  Dark  photorefraction  time  constants  for  samples  commercial  BSO  and 
BSO  26.  Each  decay  is  repealed  with  the  sample  rotated  90  . 
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A  methodology  was  developed  for  ttie  task  of  automaticaBy  extracting  semantic 
information  from  dictionary  deiintions  by  means  of  Relation  Revealing  Formulae  (RRF) 
whirA  ate  based  on  lerdcal,  syntactic  and  semantic  regularities  in  Longman's  Dictionary 
of  Conterr^rary  English  and  whidi  make  use  of  the  additional  infonnation  provided  on 
the  madiine  'eadaUe  tape.  Those  verb  definitions  which  use  to  make*  comprise  the 
sample  on  whk^  the  generic  Constant  Comparative  research  paradigm  was  applied  for 
ttte  development  of  the  spedfic  steps  of  a  new  methodology  for  extracting  semantic 
relations.  RRF  wriiidi  successfully  disamb^uate  the  senses  of  'make'  as  used  in 
definitions  were  developed  and  a  plan  for  further  testing  of  the  RRF  and  refinement  and 
a|^)lkrtion  of  tire  methodology  is  proposed. 

1 .  Introduction 

The  goal  of  my  summer  research  conduded  at  Rome  Labs  was  to  further  develop  and 
refine  a  methodology  for  extracting  semantic  knowledge  from  a  machine-readable 
didionary  (MRD).  The  methodology  should  have  generalizability  to  the  extraction  of 
semantic  knowledge  from  other  large  textual  databases.  The  semantic  representation  of 
the  dictionary  definitions  which  this  methodology  would  produce  could  be  used  in  two 
very  different  types  of  applications. 

Firstly,  the  methodology  provides  a  reasonable  approach  to  the  task  of  developing 
reasonably-sized  lexicons  for  NLP  systems.  The  creation  of  semantic  representations  of 
texts  is  a  major  hurdle  to  be  overcome  in  the  development  of  text-based  intelligent 
systems  (TBIS),  whether  these  systems  be  traditional  information  retrieval  systems  or 


whether  they  be  expert  systems  making  use  of  naturally  occurring  texts  as  one  of  their 


8-1 


knowledgebases.  The  lack  of  reasonably  sized,  semantically  rich  lexicons  is  posing  one 
of  the  major  bottiened(S  in  TBiS  development  (Lenat  et  al,  1986).  This  dearth  of 
lexicons  is  due  to  the  absence  of  a  method  for  the  efficient  acquisition  of  semantic 
knowledge  of  the  world.  The  automatic  or  semi-automatic  acquisition  of  the  lexicon  is 
considered  a  critical  factor  in  determining  how  widespread  the  use  of  natural  language 
processors  will  be  In  the  next  few  years  (Velardi  et  al,  1991).  The  knowledge  for  use  In 
such  systems  must  be  acquired  and  represented  in  such  a  manner  that  its  implicit 
semantics  are  then  available  for  the  necessary  inferencing  of  the  host  system.  Knowledge 
acquisition,  carried  out  by  knowledge  engineers,  is  the  most  commonly  used  approach  to 
acquiring  this  knowledge,  but  it  has  proven  time-consuming  and  difficult  to  contain.  The 
methodology  which  I  have  been  developing  this  summer  provides  one  plausible 
alternative  to  this  inefficient  process,  it  is  called  knowledge  extraction  (Kwasnik,  Liddy 
&  Myaeng,1989),  and  is  a  means  whereby  pre-existing  textual  sources  are  processed  in 
as  automatic  a  manner  as  possible  with  the  goal  of  extracting  the  knowledge  encoded  in 
natural  language  repositories. 

Secondly,  the  semantic  knowledge  which  this  methodology  would  extract  from  the 
MRD  could  be  used  to  produce  a  humanly  "explorable  vocabulary"  consisting  of  nodes  and 
relationally-labelled  arcs  for  use  as  a  browsing  tool  for  individuals  who  are  attempting 
to  learn  about  a  new  domain  of  knowledge.  Usua"y,  such  initial  learning  is  disadvantaged 
by  an  individual's  relative  unfamiliarity  with  the  semantic  structure  of  the  concepts  in  a 
new  field  of  interest.  One  could  envision  a  graphical  semantic  representation  (e.g. 
semantic  network)  being  a  useful  tool;  presenting  the  concepts  of  the  field  and  specifying 
the  semantic  relations  among  them.  This  explorable  vocabulary  could  serve  to  clarify  an 
individual’s  original  notions;  to  provide  appropriate  terminology  for  further 
investigation  of  the  topic;  and  to  suggest  richer  connections  between  concepts  which  had 
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not  yet  occurred  to  the  novice. 

Machine  Readable  Dictionaries  (MRDs)  form  an  excellent  and  readily  available 
textual  source  for  performing  semantic  knowledge  extraction  because  MRDs:  1)  are 
culturally  validated  sources  of  much  of  our  commonplace  knowledge  of  the  world  as 
accepted  by  native  language  speakers  over  many  years;  2)  provide  raw  data  which  is 
unbiased  by  a  particular  individual's  beliefs,  views,  knowledge  or  experience,  and;  3) 
span  a  wide  range  of  subject  areas,  thereby  providing  a  shallower,  but  more  complete 
coverage  of  the  language  than  would  be  available  in  other  corpora. 

For  my  research,  I  am  using  the  1987  version  of  Longman’s  Dictionary  of 
Contemporary  English  (LDOCE).  The  1977  version  of  this  MRD  has  been  the  object  of  a 
substantive  body  of  work  (Alshawi,  1987;  Boguraev  &  Briscoe,  1987;  Wilks  et  al, 
1987)  which  has  revealed  the  potential  richness  of  the  data  encoded  on  it.  LDOCE  is 
particularly  appropriate  for  my  work  because  it  was  developed  to  be  used  as  a  learner’s 
dictionary  by  non-native  English  speakers  and,  to  accommodate  this  use,  is  based  on  a 
defining  vocabulary  of  approximately  2000  words.  Most  of  the  definitions  are  written 
using  only  these  terms  and  any  word  used  in  a  definition  that  is  not  on  this  list  is 
indicated  by  a  type-font  code.  This  controlled  defining  vocabulary  contributes  to  the 
tractability  of  my  current  research. 

2.  Background 

The  current  research  is  based  on  earlier  investigations  I  have  conducted  into  the 
nature  and  content  of  the  definitions  in  machine  readable  dictionaries,  theories  of 
semantic  relations,  sublanguage  analysis  and  some  joint  planning  of  a  humanly 
explotable  vocabulary  (Kwasnik,  Liddy  &  Myaeng,  1 989)  and  requires  some  basic 
explanation  of  those  areas. 
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Meaning  is  commonly  defined  as  consisting  of  concepts  and  the  relations  among  these 
concepts.  In  the  English  language,  concepts  are  most  frequently  expressed  in  the  form  of 
noun  phrases  while  relations  among  concepts  are  commonly  expressed  by  either  verb 
phrases  or  prepositions.  There  are,  of  course,  exceptions  to  these  regularities,  but  the 
linguistic  practice  is  consistent  enough  for  its  assumption  to  permit  development  of  a 
regularized  approach  to  the  automatic  extraction  of  meaning  from  English  text. 

The  concepts  present  in  dictionary  definitions  are  easier  to  recognize  automatically 
than  the  semantic  relations  that  exist  among  concepts,  because  the  language  used  in 
dictionary  definitions  only  implicitly  reveals  these  relations.  For  example,  a  definition 
may  not  expressly  state  "the  purpose  of  X  is  Y",  but,  according  to  my  hypothesis,  there 
is  a  delineable  set  of  phrases  such  as  "X  is  an  instrument  for  Y-ing"  in  which  the  phrase 
"is  an  instrument  for"  reveals  the  PURPOSE  relation  between  X  and  Y.  I  define  relations 
as  properties  that  hold  between  two  or  more  entities.  The  entities  may  be  people,  events, 
objects,  situations,  characteristics,  actions,  places,  values,  etc.  Relations  define  the 
nature  of  the  interaction,  dependency,  influence  or  simply  co-occurrence  which  holds 
between  the  entities. 

Although  semantic  relations  have  been  of  research  interest  in  a  variety  of  disciplines 
such  as  psychology,  cognitive  science,  ethnography,  and  linguistics  (Evens,  1988),  no 
consensus  has  been  reached  on  a  definitive  set  of  semantic  relations.  I  have  been 
proceeding  inductively  to  uncover  all  the  semantic  relations  present  in  dictionary 
definitions,  based  on  the  belief  that  the  most  appropriate  set  of  relations  cannot  be 
determined  a  priori,  but  is  discoverable  within  the  corpus  being  analyzed  (White, 

1988).  Empirical  tests  of  applications  using  the  representations  created  from  my 
analyses  will  reveal  both  the  coverage  and  the  utility  of  the  set  of  relations  developed. 


22  Semantic  Relations  in  Definitions 


Figure  1  provides  some  examples  of  how  semantic  relations  are  expressed  in  LDOCE 
definitions.  The  terms  foilowing  the  parentheses  exist  in  relation  to  the  term  being 
defined,  in  the  relation  stated  in  capital  letters  within  the  parentheses.  The  name  of  the 
relation  within  the  parentheses  was  added  by  me.  The  phrases  in  each  definition 
presented  in  bold  print  are  the  lexical  clues  used  in  the  definition  which  suggest  the 
nature  of  that  particular  relationship.  For  example,  in  the  definition  of  diet,  the  GOAL 
relation  exists  between  diet  and  'become  thinner*  and  is  indicated  by  the  phrase  in 
order  to. 


depress  -  to  make  [CAUSAL]  less  active  or  strong 
smooth  -  having  [CHARACTERISTIC]  an  even  surface 
pitcher-  a  player  who  [AGENT]  throws  the  ball  toward  the  batter 
stab  -  to  strike  forcefully  with  [INSTRUMENT]  the  point  of  something  sharp 
diet  -  to  eat  according  to  a  special  diet,  esp.  in  order  to  [GOAL]  become  thinner 
shoot  -  an  area  of  land  where  [LOCATION]  animals  are  shot  for  sport 
naii  file  -  a  small  Instrument  with  a  rough  surface  for  [PURPOSE]  shaping  finger 
nails 

Fig.  1 :  Sample  Definitions  from  LDOCE 


2.3  _  Sublanguage  Analysis 

The  linguistic  basis  of  this  approach  to  automatic  extraction  of  knowledge  from  MRD 
derives  from  sublanguage  theory  (Sager  et  al,  1987)  which  has  shown  that  within  a 
particular  text-type,  predictable  syntactic  and  semantic  regularities  develop  over  time, 
and  these  regularities  are  used  consistently.  Sublanguages  develop  amongst  a  group  of 
language  users  who  share  a  common  purpose  and  who  must  repeatedly  produce  instances 
of  a  particular  type  of  text  (Kittredge  &  Lehrberger,  1982;  Grishman  &  Kittredge, 
1986).  Since  lexicographers  are  a  specialized  group  working  on  a  common  task,  the 
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linguistic  features  of  dictionary  entries  can  be  analyzed  using  sublanguage  methodologies 
which  have  shown  themselves  successful  in  other  text-types  consisting  of  brief,  fairly 
standardized  utterances  (Liddy  et  al,  1989;  Liddy  et  al,1991). 

Dictionaries  use  relatively  conscribed  ways  to  convey  a  definition.  This  fact  has  been 
noted  and  exploited  by  other  lexical  researchers  who  refer  to  the  "words  or  phrases  that 
are  used  frequently  in  definitions"  (Ahlswede  &  Evens,  1 988)  as  defining  formulae 
(Smith,  1981;  Ahlswede,  1985).  My  work  builds  upon  these  earlier  efforts,  but 
distinguishes  itself  from  other  lexicographic  research  on  MRDs  in  that  I  am  developing 
more  complex  Relation-Revealing  Formulae,  rather  than  the  simpler  defining  formulae. 

2.4.  Relation-Revealing  Formulae  (RRFI 

RRFs  couple  a  semantic  relation  to  the  particular  linguistic  features  of  definitions 
which  reliably  indicate  the  presence  of  that  semantic  relation.  The  difference  between 
defining  formulae  and  RRFs  is  that  the  former  make  use  of  only  lexical  or,  less 
frequently,  syntactic  string  matching,  whereas  RRF  are  capable  of  capturing  and 
representing  a  broader  scale  of  complexity  in  the  regularities  of  definitions.  The  three 
levels  of  RRF  which  I  am  developing  are  presented  in  Figure  2  and  each  is  followed  by  a 
sample  RRF  template  of  that  level. 


LEXICAL  RRF:  Require  straightforward  lexical  string  matches  wherein  the  exact  word 
or  phrase  which  has  been  established  as  a  lexical  clue  to  the  relation  is 
matched  in  the  definition.  More  complex  LEXICAL  RRF  require  the 
recognition  of  non-adjacent  phrase  patterns. 

RRF:  LOCATION  =  ’where’  +  <  > 

SYNTACTIC  RRF:  Consist  of  either  sequences  of  words  of  specified  parts-of-speech  or 
combinations  of  lexical  string  matches  and  specified  parts-of-speech 
and  takes  advantage  of  the  grammar  codes  available  on  the  LDOCE  tape. 
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RRF:  RESULTANT  =  *to  make'  +  <acljective> 

SEMANTIC  RRF:  Depend  on  information  about  semantic  features  of  words  (e.g.  animacy) 
in  the  formuiae  being  available  in  the  dictionary  enity  box  codes.  It  is 
typical  for  a  single  Semantic  RRF  to  include  lexical  and  syntactic 
matching  as  well  as  semantic  matching. 

RRF:  EXPERIENCER  =  'a'/’an*  +  <animate  noun>  +  'who'  +  <stative  verb> 

Fig.  2:  Three  Levels  of  Relation-Revealing  Formulae 


3.  Methodology 

Figure  3  provides  my  long-range  research  plan  for  the  development  and  testing  of 
the  RRF  methodology.  The  results  of  this  work  will  later  be  integrated  with  a  neural  net 
approach  for  detecting  relations  which  together  will  form  an  Intelligent  Semantic 
Relation  Assignor  (Liddy  &  Paik,  1991b). 


1 .  Maximum  Coincidence  Searching  of  LDOCE  definitions 

2.  RRF  template  development 

3 .  Semantic  relation  tagging  of  one-half  of  LDOCE 

4.  Semantic  Relation  Formulae  testing  on  remaining  one-half  of  LDOCE 

5.  Evaluation  of  Relation  Revealing  Formulae  results 

6.  Development  of  an  Intelligent  Semantic  Relation  Assignor  (ISRA) 

Fig.  3:  Research  Overview 


The  main  effort  in  the  preliminary  development  of  the  methodology  consists  of  the 
labor-intensive,  rigorous,  intellectual  analysis  of  large  sets  of  definitions  for  the 
inductive  discernment  of  semantic  relations  which  are  predictably  indicated  by  the 
various  levels  of  linguistic  regularities  described  above.  This  is  the  methodology 
development  which  I  have  been  focusing  on  this  summer  and  consists  of  the  following 
steps: 
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1 .  Computationally  analyzing  one-half  of  the  definitions  in  LDOCE  using  the  Maximum 
Coincidence  Search  procedure  to  exhaustively  identify  all  re-occurring  phrases, 
including  non-adjacent  patterns. 

2.  Establishing  the  specific  semantic  nature  of  the  relations  revealed  by  these  linguistic 
regularities. 

3.  Coupling  of  the  highest  level  of  abstraction  of  these  linguistic  regularities  with 
their  appropriate  relation  in  RRF  templates. 

3.1.  Maximum  Coincidence  Search  fMCS^ 

The  Maximum  Coincidence  Search  technique,  which  was  developed  for  the  study  of 
LDOCE  by  Woojin  Paik  (Liddy  &  Paik,  1991a)  searches  for,  and  extracts,  the  longest 
possible  string  match  (whether  terms  are  adjacent  or  not)  within  a  specified  textual 
unit,  and  creates  sets  of  these  patterns  (i.e.  coincidences  of  terms)  while  tagging  the  text 
for  the  position  in  which  each  pattern  occurred. 

MCS  is  being  used  during  RRF  development  to  extract  all  possible  coincidences  of 
terms  from  text.  Referring  to  the  example  presented  in  Figure  4,  one  can  see  how  the 
MCS  technique  works.  Processing  the  LDOCE  tape  using  MCS  detects  the  reoccurring 
pattern  of  adjacent  and  non-adjacent  combinations  of  the  two  words  'act'  and  'of:  a  total 
of  750  times  in  a  variety  of  patterns.  The  first  column  presents  ten  observed  instances 
taken  directly  from  LDOCE.  The  second  column  lists  the  generalized  surface  lexical 
patterns  existing  in  these  instances,  while  the  third  column  presents  five 
lexical/syntactic  patterns  underlying  the  ten  instances.  These  five  patterns  can  then 
be  abstracted  to  the  final  RRF  developed  for  the  semantic  relation:  ACT.  The  template 
capable  of  correctly  recognizing  all  750  occurrences  of  the  ACT  Semantic  Relation,  as 
indicated  by  any  combination  of  'acf  and  'of,  is  presented  at  the  bottom  of  Figure  4.  All 
occurrences,  when  Individually  analyzed  did  indicate  this  semantic  relation  and  none 
other. 
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OBSERVED  INSTANCES 

LEXICAL 

PATTERN 

LEXICAL  /SYNTACTIC 
PATTERN 

an  act  of  admonishing 
the  act  of  accepting 

an  act  of  ...ing 
the  act  of  ...ing 

ART  +  flcfo/  +PRES.PART 

a  single  act  of  breathing 
an  official  act  of  forgiving 
the  sound  or  act  of  firing 

a _ act  of...ing 

an _ act  of  ...ing 

the _ act  of  ...ing 

ART  +_+  act  of  +  PRES.PART 

an  act  of  secretly  escaping 
the  act  0/ purposely  staying 

an  act  of _ ...ing 

the  act  of _ ...ing 

ART  +  act  of +_+  PRES.PART 

an  actor  state  of  boiling 
the  act  or  process  of  converting 

an  act  or _ of  ...ing 

the  act  or _ of  ...ing 

ART+  act  +_+  of  +  PRES.PART 

an  example  or  act  of  not  being... 

a _ or  act  of _ ...ing 

ART+_+flcf  o/+_+  PRES.PART 

RRF  for  ACT  =  ART +  [  ]+act  +  U+of +[  ]+ PRESENT  PARTICIPLE 

Fig.  4:  Sample  MCS  Patterns 

Although  this  is  a  trivial  example,  it  exemplifies  precisely  how  MCS  allows  me  to 
examine  the  machine-readable  tape  of  LDOCE  efficiently  in  order  to  detect  all  possible 

variations  of  word  patterns  and  to  uncover  from  amongst  the  surface  variety  the  basic 
phrasal  patterns,  which  then  undergo  Intellectual  analysis  for  the  inductive  discernment 
of  the  semantic  relation  indicated  by  each  basic  pattern  and  to  develop  templates  which 
can  be  used  by  an  algorithm  to  detect  a  particular  semantic  relation  no  matter  how  the 
basic  phrasal  pattern  is  varied. 


3.2.  Establishing  the  RRF  Methodology 

The  establishment  of  a  methodology  for  developing  RRF  was  begun  with  an  analysis  of 
verb  definitions  in  LDOCE.  One  reason  for  this  choice  was  to  facilitate  the  research  of 
Michael  McHale  of  Rome  Labs,  whose  development  of  a  principle  based  parser  (McHale, 


1991)  requires  knowledge  of  the  thematic  roles  (e.g.  agent,  experiencer)  of  noun 
phrases  for  the  purpose  of  eliminating  Ill-formed  syntactic  structures.  The  information 
as  to  thematic  roles  is  not  explicitly  available  in  any  of  the  coded  fields  in  LDOCE  but 
RRF  offer  one  means  of  determining  which  thematic  roles  occur  with  each  verb.  I  also 
considered  verbs  a  useful  group  to  analyze,  since  research  to-date  on  the  simpler 
defining  formulae  has  focused  on  either  noun  or  adjective  definitions,  with  little 
attention  to  verb  definitions. 

Most  definitions  provide  a  genus  term  for  the  word  being  defined  and  then  some 
differentia  that  distinguish  the  word  being  defined  from  other  classes  of  the  same  genus. 
Since  verbs  are  usually  defined  with  a  verb  as  their  'genus'  term,  a  frequency  count  was 
made  of  the  individual  verbs  from  one-half  of  the  verb  definitions  in  LDOCE,  and  showed 
that  'make'  occurred  529  times;  'cause'  occurred  402  times  and  'be'  occurred  380 
times.  Given  these  figures,  the  choice  was  made  to  begin  with  'make'  and  'cause'  for 
development  of  RRF.  The  work  has  proceeded  on  both  of  these  verbs,  with  some 
productive  overlaps  in  the  RRF,  but  only  the  development  of  RRF  based  on  'make'  will  be 
herein  reported,  as  the  RRF  for  'cause'  are  not  refined  to  my  satisfaction. 

By  using  MCS,  I  retrieved  all  2,  3,  4,  and  5  word  (contiguous  or  noncontiguous) 
word  patterns  which  contain  'make'.  Given  that  the  development  of  RRF  begins  as  an 
inductive  process,  this  is  the  first  step  to  be  done.  Results  of  the  MCS  produce  data  on 
which  the  simplest  level,  lexical  RRF  are  based,  but,  in  addition,  given  appropriate 
analysis,  the  retrieved  data  will  also  suggest  what  the  more  complex  syntactic  and 
semantic  RRF  might  be,  and  how  information  available  on  the  machine-readable  tape 
might  be  used  in  developing  this  type  of  RRF. 

As  an  example,  retrieving  the  5  word  patterns  containing  'to  make'  includes  the 
following  patterns  and  their  frequencies: 
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to  make  (something)  35  times 

to  make  (  )by  30  " 

to  make  (someone)  27  " 

Although  none  of  these  5  word  patterns  may  be  indicative  of  a  particular  semantic 
relation,  they  do  suggest  some  lexical  patterns  to  be  looked  for  in  shorter  phrases  and 
also  begin  to  suggest  some  of  the  semantic  variations  of  'to  make'  as  used  in  LDOCE 
definitions.  In  that  the  goal  of  RRF  development  is  to  devise  a  means  whereby  the 
semantic  relations  contained  in  definitions  can  be  explicated,  it  is  essential  that  the 
possible  senses  of  frequently  occurring  verbs  such  as  'make'  can  be  disambiguated.  For 
instance,  it  would  seem  intuitive  that  the  sense  of  'make'  would  be  different  when 
followed  by  'something'  rather  than  by  'someone'  and  that  semantic  representations  of 
these  definitions  should  reflect  these  differences. 

By  consulting  the  definition  of  the  word  'make'  itself  as  provided  in  LDOCE,  one  sees 
that  'make'  has  three  basic  senses,  which  can  be  briefly  paraphrased  as; 

1.  to PRODUCE 

2.  to  PERFORM 

.3.  to  CAUSE  or  CAUSE  TO  BE 

This  information  can  be  entered  as  the  hypothesis  into  the  inductive-deductive  loop 

HYPOTHESIS 


DATA 

Fig.  5:  Constant  Comparative  Methodology 
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of  the  Constant  Comparative  methodology  (Glaser  &  Straus,  1967)  as  seen  in  Figure  5. 
This  is  the  basic  research  paradigm  i  am  following  in  this  effort. 

Next,  with  these  hypothesized  senses  in  mind,  I  used  the  second  function  of  the  MCS 
program  to  look  at  the  sets  of  definitions  which  contain  a  searched  pattern.  Analyzing 
these  sets  of  definitions,  the  syntactic  and  even  the  semantic  patterns  begin  to  suggest 
themselves.  For  example,  there  are  many  definitions  which  are  of  the  basic  form* 

■to  make*  +  ADJECTIVE 

where  the  great  semantic  variety  of  the  adjective  does  not  matter  in  terms  of  the 
predictable  relation  between  the  word  being  defined  and  that  adjective.  For  example,  the 
definitions  in  Figure  6  all  reveal  the  CAUSE  TO  BE  relation  with  the  adjective  that 
follows  'to  make',  although  in  some  instances  it  might  not  follow  it  directly. 


constrict  -  to  make  [CAUSE  TO  BE]  narrower,  smaller,  or  tighter 
consummate  -  to  make  (CAUSE  TO  BE]  (a  marriage)  complete  by  having  sex 
convey  ■  to  make  [CAUSE  TO  BE]  (feelings,  ideas,  thoughts,  etc.)  known 
defuse  •  to  make  [CAUSE  TO  BE]  less  dangerous  or  harmful 
exhilarate  -  to  make  [CAUSE  TO  BE]  (someone)  cheerful  and  excited 

Fig.  6:  Definitions  containing  CAUSE  TO  BE  sense  of  'make' 


In  addition  to  identifying  that  'to  make'  reveals  the  CAUSE  TO  BE  relation  between  the 
verb  being  defined  and  the  following  adjective,  I  have  also  identified  the  adjective  or 
adjectives  as  being  in  the  RESULTING  ATTRIBUTE  relation  to  the  verb  being  defined. 
Figure  7  shows  these  relations. 


constrict  -  to  make  [CAUSE  TO  BE]  narrower  [RES.ATT],  smaller  [RES.ATT],  or  tighter 
[RES.ATT] 

consummate  -  to  make  [CAUSE  TO  BE]  (a  marriage)  complete  [RES.ATT]  by 
having  sex 

convey  -  to  make  [CAUSE  TO  BE]  (feelings,  ideas,  thoughts,  etc.)  known  [RES.ATT] 
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deafen  -  to  make  [CAUSE  TO  BE]  deaf  [RES^TT],  esp.  for  a  short  time 
defuse  -  to  make  [CAUSE  TO  BE]  less  dangerous  [RES.ATT]  or  hatmful[RES.ATT] 
exhilarate  -  to  make  [CAUSE  TO  BE]  (someone)  cheerful  [RES.ATT]  and  excited 
[RES.ATT] 

Rg.  7:  CAUSE  TO  BE  and  RESULTING  ATTRIBUTE  relations 

Further  analysis  allowed  me  to  delineate  the  semantic  nature  of  the  other  entities  in 
the  CAUSE  TO  BE  schema.  These  are  PATIENT  for  all  human  nouns  and  AFFECTED  OBJECT 
for  all  other  nouns.  In  addition,  the  MEANS  relation  is  revealed  quite  predictably  in  'to 
make'  definitions  by  a  phrase  beginning  with  'by'  at  the  end  of  the  definition.  Figure  7 
shows  the  same  definitions  with  these  additional  relations  identified.  Whether  a  noun  is 
human  or  not  can  be  determined  by  accessing  the  Box  Codes  which  are  available  on  the 
LDOCE  tape  but  not  In  the  printed  version  of  the  dictionary  and  indicate  either  the 
semantic  nature  of  the  word  being  defined  or  the  semantic  restrictions  on  the  subject  and 
object  of  the  verb  being  defined  or  the  acceptable  noun  that  can  be  qualified  by  the 
adjective  being  defined.  This  is  essential  knowledge  needed  for  developing  Semantic  Level 
RRF. 


constrict  -  to  make  (CAUSE  TO  BE]  narrower  [RES.ATT]  smaller[RES.ATT],  or  tighter 
[RES.ATT] 

consummate  -  to  make  (CAUSE  TO  BE]  (a  marriage)  [AFF.OBJ]  complete  [RES.ATT]  by 
having  sex  [MEANS] 

convey  -  to  make  [CAUSE  TO  BE]  (feelings,  ideas,  thoughts,  etc.)  [AFF.OBJ]  known 
[RES.ATT] 

deafen  -  to  make  [CAUSE  TO  BE]  deat  [RES. ATT],  esp.  for  a  short  time 
defuse  ■  to  make  [CAUSE  TO  BE]  less  dangerous  [RES.ATn  or  harmful[RES.ATT] 
exhilarate  -  to  make  [CAUSE  TO  BE]  (someone)  [PATIENT]  cheerful  [RES.ATT]  and 
excited  [RES,ATT] 

Rg.  8:  CAUSE  TO  BE,  RESULTING  ATTRIBUTE,  AFFECTED  OBJECT,  PATIENT  &  MEANS 
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Processsig  cf  vsib  deSmStms  a»T3^^  'to  nt^ceT  in  one  o(  LOOCE  protted 
two  types  of  usefiil  resi^  Fhsfy.  the  BBF^  titen^eives,  wiucii  ccoid  successfully 
cfifferentiale  among  the  venous  senses  cf  "to  mske’  sad  sSso  leve^  the  semantc  rotes  of 
other  constiluenls  ai  the  de^uSons,  and  seconrSy;  the  particutar  steps  of  the 
methodoicgy  used  were  cocSned  sahidenty  that  the  methodoScgy  can  now  be  taught  to 
reseaicA  asdstants  so  that  hiriher  devdopment  of  a  semantic  iexiicon  based  on  LOOCE  can 
be  produced  for  use  in  a  range  of  NLP  systems. 

4.1  RRF  for 'to  make' 

The  RRF  for  the  remaining  two  senses  cf  *to  make'  were  devefoped  following  similar 
procedures.  This  effort  was  aided  by  the  fact  that  the  secondary  RRFs  devefoped  as  by¬ 
products  while  developing  RRF  for  the  first  sense  of  'to  make'  were  usabfo  in  other 
situations  as  welt.  Rgc'e  9  presents  the  base  s.  ses  of  'to  make’  which  were  evide 
in  t'  e  definitions,  along  with  a  sample  defiratior  '  nlaining  each  of  these  senses. 


1 .  to  make  [  a  ]  =  (to  produce  a) 

crack  -  to  make  a  sudden  loud  sharp  sound 

2. a.  to  make  [  b  ]  =  (to  cause  to  be  b) 

disquiet  -  to  make  anxious 
b.  to  make  [  c  ]  [  b  ]  =  (to  cause  c  to  be  b) 

annoy  -  to  mcj<e  someone  a  little  angry  or  impatient,  esp.  by  repeated 
troublesome  actions  or  attacks 

3.  to  make  [c]  [d]=  (to  cause  c  to  d) 

abase  -  to  make  (esp.  oneself)  lose  self-respect 

Fig,  9  :  Four  senses  of  'to  make' 


These  four  scnaca  of  'to  make’  can  be  detected  by  syntactic  level  RRF  because  each  of  the 
four  patterns  can  be  matched  on  part  of  speech  information,  as  seen  in  Figure  10.  These 
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syiHacSc  level  RRF  reqi^e  the  RRF  ^^jGcalor  fo  consult  the  online  LOOCE  file  to 
delem^  the  pails  of  speech  of  flte  tvords  in  the  definitions. 


1 .  to  make  [  a  ]  =  (to  produce  a) 

RRF  (PRODUCE  =  lo  make*  +  noun  phrase 
2.3.  to  make  [  b  ]  =  (to  cause  to  be  b) 

RRF  (CAUSE  TO  BE)  =  to  make*  +  ac^ecGve 
b.  to  make  ( c  ]  [  b  ]  =  (to  cause  c  to  be  b) 

RRF  (CAUSE  TO  BE)  =  to  make*  -r  noun  phrase  +  adjective 
3.  to  make  [  c  ]  [  d  ]  =  (to  cause  c  to  ^ 

RRF  =  (CAUS^  =  to  make*  +  noun  phrase  +  verb 

Fig.  10:  RRF  for  detecting  senses  of  *to  make* 


In  addition,  semantic  information  that  is  available  in  the  Box  Codes  on  the  machine- 
Readable  LDOCE  can  also  be  included  m  the  RRF  if  a  fuller  conceptual  representation  of 
the  dennitf!$ns  is  desired.  For  example,  in  Rgure  11.  if  the  noun  phrase  that  fills  the  A 


1.  RRF  (PRODUCE]  =  to  make*  +  noun  phrase  (A) 

2. a.  RRF  (CAUSE  TO  BE]  =  to  make*  +  noun  phrase  (C)  +  adjective  (B) 

-  or  - 

2. b.  RRF  [CAUSE  TO  BE]  =  to  make*  +  adjective  (B) 

3.  RRF  [CAUSE];  =  to  make’  +  noun  phrase  (C)  +  verb 

where; 

A  =  RESULT  CONCRETE  if  Box  Code  1  =  S,  J,  N 
or 

A  =  RESULT  ABSTRACT  if  Box  Code  1  =  T 
B  =  RESULTING  ATTRIBUTE  if  Box  Code  1  =  T 
0  =  PATIENT  if  Box  Code  1  =  Q,  P,  A,  H,  B,  D,  M,  F 
or 

C  =  AFFECTED  OBJECT  if  Box  Code  1  =  S,  J,  N,  I,  L,  G 
Fig.  11:  Fully  Realized  RRFs  for  Verbs  Defined  by  "to  make" 


slot  in  RRF  1,  has  either  Box  Code  S,  J,  or  N  on  the  LDOCE  tape,  then  that  noun  phrase  is 
the  CONCRETE  RESULT  of  the  verb  being  defined.  Otherwise,  if  the  noun  phrase  has  Box 
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Code  T,  that  noun  phrase  is  the  ABSTRACT  RESULT  of  the  verb  being  defined.  This  is 
exactly  the  level  of  refined  semantic  representation  that  is  needed  by  complex  NLP 
systems  for  which  this  methodology  wll  be  used  to  develop  lexicons. 

4.2  RRF  Methodology  Devetooment 

The  goal  of  this  summer  research  effort  was  to  develop  a  methodology  for  extracting 
semantic  relations  from  dictionary  definitions.  In  outline  form,  the  methodology  which  I 
have  described  in  procedural  detail  above,  is  summarized  in  Figure  12. 


1 .  Decide  part  of  speech  to  begin  with. 

2.  Run  frequency  distribution  of  vocabulary  used  in  definitions  of  this  part  of  speech. 

3.  Choose  individual  words  to  begin  analysis  with. 

4 .  Run  Maximum  Coincidence  Search  on  these  words. 

5 .  Use  Constant  Comparative  Methodology  to  iterate  between  data  and  hypothesized  RRF. 

6.  Search  for  lexical  patterns. 

7.  Retrieve  definitions  containing  these  patterns. 

8.  Determine  semantic  relation  being  conveyed. 

9.  Study  fuller  sets  of  definitions  to  determine  whether  RRF  can  be  generalized  to  a 
syntactic  or  semantic  level  RRF. 

1 0  .Test  RRF  on  other  half  of  LDOCE. 

1 1  .Revise  RRF  and  retest. 

Fig.12:  Steps  in  RRF  Methodology  Development 


5.  Conclusions 

The  successful  production  of  both  syntactic  and  semantic  level  RRF  which  can 
disambiguate  the  individual  senses  of  'to  make’  in  dictionary  definitions  and  produce 
semantically  lagged  representations  of  verbs  which  are  defined  by  'to  make'  suggests  that 
this  methodology  will  provide  the  necessary  tool  for  developing  automatic  procedures  for 
extracting  semantic  relations  from  text. 
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6.  Future  Work 


The  current  effort  has  resulted  In  the  successful  development  of  RRF  for  'to  make' 
and  the  specification  of  a  methodology  for  developing  RRF  which  will  produce 
representations  of  text  in  which  the  implicit  semantics  are  made  explicit.  There  are 
three  possible  routes  to  be  followed  in  the  continuation  of  this  work: 

1 .  Empirical  testing  of  'to  make’  RRF  on  the  second  half  of  LDOCE. 

2.  Generalization  of  'to  make’  RRFs  to  similar  verbs. 

3.  Training  Research  Assistants  to  continue  RRF  development  under  my  supervision. 

It  is  hoped  that  all  of  these  avenues  can  be  pursued  Immediately  through  funding  by 

the  Research  Initiation  Program,  it  is  essential  that  the  research  be  pursued,  for  as 
Velardi  et  al  (1991,  p.  157)  hav'e  stated:  "automatic  or  semi-automatic  acquisition  of 
the  lexicon  is  a  critical  factor  in  determining  how  widespread  the  use  of  natural  language 
processors  v/ill  be  In  the  next  few  years". 
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APPLICATION-BASED  UTILITY  EVALUATION:  A 
DISCUSSION  LEADING  TO  ASSESSING  THE  ROLE 
OF  END-USERS  IN  THE  EXPLOITATION  OF 
VIRTUAL  REALITY  TECHNOLOGY 


Michaels.  NHan,Ph.D. 

INTRODUCTION 

Systems,  whether  computerized  or  not,  with  or  without  artificial  intelligence, 
incorporating  advanced  display  technology  or  not,  etc.can  be  seen  as  nothing  more  than 
a  set  of  procedures  and  technical  capabilities  linked  together  to  solve  a  particular 
problem  or  set  of  related  problems.  This  is  as  true  of  a  bureaucratic  standard  operating 
procedure  as  it  is  of  a  knov^ledge  based  distributed  group  decision  support  system. 

Since  the  advent  of  computers  (and  especially  in  the  last  ten  years),  the  kinds  of 
problem  solutions  that  we  have  been  implementing  in  an  automated  fashion  are 
becoming  increasingly  sophisticated  as  are  the  kinds  of  display  technologies  (e.g., 
multimedia,  virtual  reality)  and  peripherals,  network  architectures,  etc.  that  the  system 
designers  employ  to  deliver  solutions  in  a  systematic  manner. 

As  application  and  information  systems  become  more  advanced,  they  ane  also 
becoming  more  complicated.  Some  of  the  concerns  of  this  advancement  and  its  associated 
complexity  are  obvious,  like  increases  in  training  overhead  required  for  users  to  learn  and 
understand  the  system.  These  concerns  then  become  desired  goals  for  system  design  but 
solutions  are  by  no  means  obvious  in  the  current  state-of-the-art  of  system  design.  Some 
implications  are  not  as  obvious,  like  evaluating  a  system  for  its  use  of  appropriate 
technology  in  delivering  solutions.  This  is  no  longer  as  straightforward  as  it  once  was.^ 
Assessment  of  isolated  technology  (i.e.,  not  incorporated  into  a  specific  system)  tends  to  be 
in  terms  of  its  capability,  e.g.,  processing  speed,  resolution,  transmission  speed,  etc.  The 
evaluation  metrics  that  are  currently  employed  for  system  evaluation  are  oriented  towards 
user  performance  (and  are  therefore  indirect  measures  at  best  anyway)  and  have  been 
borrowed  from  education  and  psychology  without  significant  change.  If  one  assumes  that 
these  indirect  measures  (e.g.,  retention,  accuracy,  time  on  task,  demographics,  personality 


variables,  etc.)  have  been  appropriate  for  simple  learning  theory  and  physiological 
perception,  it  does  not  necessarily  follow  that  they  are  also  appropriate  for  the  increased 
demand  on  end  users  or  the  increased  sophistication  of  the  problems  so  addressed.  In 
particular,  these  inherited  metrics  cannot  directly  assess  the  complex  cognitive  behaviors 
associated  with  the  specific  problem  solving  domain  as  implemented  in  any  particular 
system  (which  includes  the  particular  technical  enhancements  employed).  The  increased 
sophistication  and  complexity  of  current  and  future  systems  calls  for  a  new  look  at 
evaluation. 

The  purpose  of  this  paper  is  to  describe  an  example  of  an  assessment  approach  that 
directly  addresses  the  evaluation  issue  of  the  utility  of  system  features  (both  hardware  and 
software)  that  are  employed  (or  might  be  employed)  in  terms  of  the  specific  problem  the 
system  is  designed  to  solve.  Such  application-based  utility  evaluation  procedures  hav^  been 
called  for  (e.g.,  Dervin  &  Nilan  1986)  and  have  begun  to  be  developed  in  information 
system  contexts  (e.g.,  Nilan  1992;  Nilan  &  Hert  1991),  but  this  paper  represents  the  first 
attempt  to  set  forth  this  logic  in  a  more  generalized  manner,  especially  in  a  manner  that  also 
allows  for  concurrent  assessment  of  appropriate  technologies  (i.e.,  display  devices, 
pointing  devices,  tactical  feedback  mechanisms,  etc.)  in  the  vinual  reality  (VR)  context.  It 
is  important  to  note  at  the  outset  tha.  the  purpose  here  Is  to  augment  current  assessment/ 
evaluation  procedures  rather  than  to  displace  them.  In  otner  words,  the  approach  presented 
here  is  ‘,een  as  an  addition  that,  when  u.sed  in  conjunction  with  existing  techniques,  will 
facilitate  decision  making  for  appropriate  technology  use  in  system  design  and  system 
evaluation. 

SYSTEMS,  SYSTEM  DESIGN,  AND  SYSTEM  EVALUATION 

Figure  1  shows  a  generic  description  of  the  notion  of  a  system. 

Central  to  this  notion  of  a  system  is  the  particular  problem  to  be  solved.  This  is  in 
keeping  with  so-called  “information  resource  management”  (IRM)  strategies  currently 
being  implemented  in  many  organizations.  Take  a  manufacturing  process  as  an  example  of 
a  problem  for  which  we  want  to  design  a  system.  The  first  step  is  to  define  the  process  in 
terms  of  its  constituent  activities  in  a  time  order  that  reflects  the  current  practice.  So,  we 
begin  with  specifying  a  product  v/hich  is  to  be  the  goal  of  the  manufacturing  process.  Let’s 
call  it  a  widget.  After  we  have  designed  this  widget,  we  must  then  purchase  the  raw 
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Figure  1:  System  =  A  set  of  procedures  designed  to  solve  a 
particular  problem.  Applies  to  bureaucratic  procedures  (e.g.,  standard 
operating  procedures,  forms),  the  organization  of  information  (e.g.. 
Library  of  Congress  classification  system),  as  well  as  computerized 
applications  including  display  technology,  interfaces,  functional 
capabilities  of  the  application  software  and  any  information  associated 

materials  necessary  for  producing  it,  warehouse  them  until  they  are  needed  and  transport 
them  to  the  manufacturing  location.  At  the  manufacturing  location,  we  have  a  sub-system 
that  actually  produces  the  v.?idget,  which  is  then  transported  to  another  warehouse  to  await 
its  sale  to  a  customer.  The  sale  of  widgets  is  carried  out  by  a  marketing  sub-system  that 
handles  such  things  as  advertising,  shipping  and  feedback  from  the  widget  purchasers, 
which  ultimately  effects  the  design  of  the  widget.  Then,  the  cycle  starts  all  over  again. 
Figure  2  graphically  illustrates  this  cycle. 

Now  that  we  have  documerted  the  process  of  widget  manufacturing,  it  is  possible 
for  an  information  system  designer  to  examine  and  evaluate  the  entire  cycle,  including 
various  technologies  in  terms  of  their  ability  to  provide  more  efficient  manufacturing.  So 
the  designer  might  try  to  design  an  information  system  that  allows  us  to  employ  “just-in- 
time”  technology.  This  technology  allows  us  to  functionally  link  sales  demands  for  widgets 
directly  to  our  purchase  of  raw  materials  and  to  the  numbers  of  employees  we  employ  in 
the  manufacturing  sub-system  process.  What  this  means  for  our  cycle  is  that  we  only  order 
enough  raw  materials  for  what  we  are  going  to  manufacture  at  any  particular  time  which  is 
in  turn  linked  to  the  demand  for  widgets.  We  have  been  able  to  cut  out  of  the  cycle  the 
warehousing  and  storage  steps  as  well  as  having  more  efficient  use  of  labor  during  the 
actual  manufacture.  The  savings  to  our  organization  from  a  (relatively)  simple  information 
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Figure  2:  Hypothetical  widget  manufacturing  cycle. 

system  will  probably  be  substantial  without  any  loss  of  quality  of  the  product  or  its 
availability  to  consumers  on  demand. 

This  is  an  example  of  a  simple  system  that  manages  a  physical  entity,  i.e.,  a  widget. 
It  is  clear  in  this  example  that  the  evaluation  of  the  just-in-time  technology  is  intimately  tied 
to  the  problem  at  hand,  i.e.,  manufacturing  widgets.  The  evaluation  criteria  employed 
pertain  to  efficiency  (i.e.,  time  and  money)  associated  with  the  manufacturing  cycle 
technology  itself.  Let’s  take  a  more  complicated  example,  one  that  deals  with  a  cognitive 
process  to  illustrate  the  proposed  application-based  utility  approach  to  evaluation  of 
technology,  a  decision  support  system  that  helps  consumers  decide  what  kind  of  new  car  to 
purchase. 

This  example  deals  with  cognitive  activities  that  do  not  necessarily  have  any 
inherent  time  dependency  associated  with  individual  behaviors,  nor  is  there  an  established 
“correct”  way  to  approach  the  problem.  As  in  the  widget  example,  we  first  try  to  define  the 
task  at  hand.  Standard  knowledge  acquisition  procedures  for  knowledge  based  or  expert 


systems  wotild  call  for  interaction  with  a  new  car  expert  (see  Agarwal  1988)  to  establish 
the  criteria  associated  with  purchasing  a  new  car  (i.e.,  things  to  look  for  in  a  new  car)  and 
the  rules  for  employing  these  criteria  (i.e.,  if  you  have  children,  then  you  will  need  a  4-door 
or  a  wagon  or  a  van,  etc.).  Typical  methods  involve  asking  the  expert  to  specify  what  should 
be  looked  for  in  the  purchase  of  a  new  car.  The  answers  in  this  example  are  exactly  what 
we  might  expect.  The  expert  tells  us  we  should  look  for  low  price,  safety,  low  maintenance 
costs,  good  resale  value,  etc.  In  fact,  the  resulting  range  of  requirements  is  probably  the 
same,  as  we  would  get  if  we  asked  a  few  non-experts  on  the  street.  These  requirements 
would  then  be  built  into  a  computer  system  that  presents  the  new  car  buyer  with  a  series  of 
screens  that  present  individual  questions  and  “if...then...”  decision  rules  to  assist  in  the 
decision  making.  However,  there  are  some  problems  that  emerge  with  the  resulting  system 
when  we  evaluate  it 

The  knowledge  base  might  be  evaluated  by  comparing  several  experts’ 
determinations  of  the  sets  of  criteria  and  rules  although  this  is  seldom  done  in  practice  (see 
Kannan  1991  as  an  example  of  the  use  of  a  single  expert  for  both  the  knowledge  base  as 
well  as  the  subsequent  evaluation).  The  experts  might  agree  to  a  degree  such  that  we  are 
convinced  that  from  the  experts’  perspective,  the  system  has  sufficient  coverage.  However, 
when  we  look  at  the  use  of  the  system  by  end  users,  several  surprises  occur.  First,  it  takes 
the  users  quite  a  bit  of  time  to  learn  how  the  system  works.  Several  walk  away  from  the 
system  because  they  don’t  want  to  be  bothered  (after  all,  they  are  only  buying  a  car).  This 
phenomenon  is,  of  course,  not  captured  by  our  formal  evaluation  procedures  except  perhaps 
where  we  look  at  demographic  and/or  personality  measures  and  note  that  those  who  walked 
away  from  the  system  did  not  have  a  “sufficient”  technical  background.  We  interpret  this 
as  meaning  that  this  system  is  only  for  technical  people  or  for  those  who  will  “pay”  the 
training  overhead  to  use  the  system.  Other  users  who  did  take  the  time  to  learn  how  the 
system  worked  walked  away  before  they  had  finished  the  entire  decision  making  scenario. 
They  told  us  it  took  them  too  long  to  use  it. 

Of  those  who  took  the  time  to  learn  how  to  use  the  system  and  finished  the 
decision  making  scenario,  we  can  compare  their  purchase  with  what  the  system  told 
them  they  should  do  and  we  find  discrepancies.  These  discrepancies  are  apparent 
when  we  use  an  “accuracy-”  type  evaluation  criterion  where  the  expert  system’s 
determination  is  seen  as  the  “correct”  response  and  the  users’  deviations  from  this 


9-5 


response  are  their  scores  on  the  measure.  We  may  also  try  to  find  out  from  users  what 
criteria  actually  were  behind  their  decisions,  although  this  is  almost  never  done  except  in 
the  context  of  decision  support  system  “satisfaction”  measures  (see  Dervin  &  Nilan  1986 
for  a  discussion  of  this  category  of  system  evaluation  criteria).  >\fith  these  traditional 
satisfaction  measures  we  find  out  that  users  were  not  satisfied  with  the  advice  that  the 
experts  gave,  discounted  the  system’s  advice  and  therefore,  deviated  from  the  “correct” 
decision.  We  may  even  ask  the  users  what  they  were  dissatisfied  with  (another  uncommon 
tactic)  and  find  that  there  were  several  criteiia  that  they  considered  important  that  the 
system  did  not  address  that  are  “intangible.”  We  (the  clients  for  the  system)  might  therefore 
conclude  that  the  system  works  well  (and  probably  won’t  even  question  its  implementation 
on  a  computer),  but  only  for  those  people  who  have  a  technical  background,  who  take  the 
time  to  learn  how  to  use  the  system,  and  then  follow  the  system’s  advice.  We  do  not  come 
away  from  this  evaluation  experience  with  any  insight  into  the  technology  used,  with  the 
possible  exception  of  cost,  and  we  have  virtually  i-.o  data  on  how  to  change  the  system  so 
that  it  is  more  useful  to  users.  We  might  try  somt  intuitive  solutions,  like  putting  several 
questions/rules  on  a  single  screen  to  cut  down  or  the  time  it  takes  for  a  user  to  run  through 
the  scenario.  We  might  even  adopt  a  rapid  protot .  ping  technique  to  involve  users  in  our  re¬ 
design,  but  basically,  we  are  stuck  with  the  parti  ’ular  implementation  that  was  originally 
developed.  Although  if  we  go  back  after  a  year  ct  .so  and  compare  the  users’  satisfaction 
with  their  new  car  purchases,  we  will  find  little  oiffercnce  in  the  variance  between  those 
who  followed  the  system’s  advice  and  those  who  riid  not.  So  ultimately,  what  good  is  the 
system? 

Let  us  take  the  same  problem  and  illustrate  an  application  based  utility  approach  to 
the  design  and  evaluation  of  the  decision  support  system.  There  wiU  be  several  differences 
that  will  be  pointed  out  after  the  description  is  presented. 

Given  that  the  decision  of  what  to  look  for  in  a  new  car  is  not  a  process  that  involves 
any  physical  phenomena  and  given  that  there  is  no  inherently  “correct”  way  to  go  about 
making  this  kind  of  decision,  the  proposed  approach  would  use  a  different  technique  for 
determining  what  the  nature  of  the  task  is.  First,  recent  car  buyers  would  be  asked  to 
describe  their  experiences,  from  the  time  they  first  started  to  think  about  a  new  car  until  they 
had  actually  purchased  it,  as  a  series  of  steps.  Then,  for  each  step  in  the  process,  the  buyers 
would  be  asked  (for  example)  what  questions  they  had,  whether  they  got  answers  to  their 
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questions,  what  source  had  the  answer,  etc.  We  would  end  up  with  a  matrix-like 
description  of  the  activities  (or  steps)  in  their  car  purchasing  behavior  in 
conjunction  with  some  of  the  cognitive  information  concerns  associated  with  each 
step.  Figure  3  illustrates  this  activity-information  matrix  for  one  such  buyer. 


Figure  3:  An  abstract  activity  by  information  matrix 

with  questions  that  the  user  had  specified  at  specific  steps  as 
the  operationalization  of  the  information  component  of  the 
matrix.  This  is  essentially  a  graphic,  time  sensitive 
representation  of  a  problem  solving  experience. 


This  process  would  be  repeated  for  a  cross-section  of  recent  buyers,  ranging  from 
young  to  old,  from  expert  to  novice,  rich  to  poor,  etc.  (the  actual  number  of  people  depends 
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upon  the  nature  of  the  problem  being  aridressed  and  the  variance  in  perceptions  across  the 
end  user  population  although  it  is  alway*  far  fewer  that  individual  differences  approaches 
to  system  design  would  suggest^).  The  next  step  in  the  process  is  to  design  a  model  of  the 
new  car  buying  problem  solving  process  by  aggregating  and  summarizing  across  individual 
car  buyer  experiences.  The  model  developing  process  involves  simply  looking  for 
similarities  in  the  steps  in  individual  accounts  in  two  different  ways:  first,  in  the  description 
of  the  activities  (i.e.,  the  actual  behavior)  and  second,  in  tl»e  ordinal  time  relationship 
between  these  similar  activities.  Figure  4  illustrates  this  logic  and  procedure.  In  the  case  of 
car  buying,  for  example,  we  might  find  that  very  early  in  the  process,  buyers  would  go  out 
and  look  at  new  cars  to  see  what  was  out  there  and  begin  to  classify  cars 
according  to  price.  For  system  design  purposes,  there  are  two  striking  aspects  to  this 
finding:  first,  virtually  all  of  our  new  car  buyers  did  this  (albeit  some  of  them  only  in 
newspapers  and  magazines)  and  second,  most  of  them  did  this  early  in  the  problem  solving 
process.  We  will  find  similar  patterns  later  on  in  the  process  (e.g.,  taking  a  car  for  a  test 
drive).  The  model  of  the  task  of  buying  a  new  car  that  we  will  be  able  to  create  by  this 
process  will  represent  a  user  based  view  of  the  task  (as  opposed  to  the  arbitrary  listing  of 
criteria  and  rules  in  the  expert  based  view)  that  is  a  dynamic  representation  of  the  actual 
problem  solving  process  (analogous  to  the  manufacturing  cycle  used  in  the  first  example). 
Note  also  that  the  information  needed  for  an  individual  to  move  through  this  process  is 
contained  in  the  information  side  of  the  matrix  and  can  also  be  incorporated  into  the  model 
(see  Nilan  1992  for  a  more  detailed  description  of  this  analytic  process). 

The  resulting  model  provides  us  with  a  model  of  the  task  (i.e.,  a  knowledge  base) 
for  which  we  are  designing  a  system.  It  incorporates  a  wide  variety  of  criteria  associated 
with  the  process  that  are  tied  to  specific  points  in  the  dynamic  problem  solving  process  (as 
opposed  to  simple  lists  in  the  first  example).  We  also  note  that  the  range  of  criteria  in  this 
version  is  much  broader  than  the  expert-derived  list,  for  example,  the  color  of  the  car  is 
important  because  you  don’t  want  to  buy  a  dark  car  if  you  live  in  a  hot  region  since  it  retains 
excessive  heat  in  the  summer.  Now,  as  in  the  widget  example,  we  can  begin  to  make 
decisions  about  what  kind  of  system  we  should  design. 

We  note  that  the  process  occurs  over  a  long  period  of  time  and  at  several  different 
locations  so  we  need  to  be  sure  that  users  can  access  our  DSS  at  various  times  and  at  various 
locations.  If  computer  networking  was  available  and  most  of  our  users  had  personal 
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computers  with  modems,  it  would  probably  be  possible  to  make  this  system  available  over 
a  network.  Neither  of  those  assumptions  is  warranted  however,  but  we  may  be  able  to  make 
the  system  available  in  places  where  people  are  thinking  about  this  problem  (i.e.,  in  car 


‘Test  drive  a  car.” 


□ 

r~i 

□ 


□  □□□ 


Figure  4:  A  hypothetical  example  of  several  user  descriptions  of 

buying  new  cars  showing  the  overlap  of  activities  (^iook  for  what  is 
out  there”  and  “test  drive  a  car”)  as  well  as  the  ordinal,  temporal 
relationship  between  the  two  overlaps  (“looking”  early  in  the  process 
and  “driving”  late  in  the  process). 


showrooms)  if  there  are  computers  in  tliose  locations.  In  this  hypothetical  case  a  pamphlet 
is  probably  a  more  appropriate  technology.  Even  though  it  is  an  inefficient  format  for 
updating,  it  is  a  format  that  will  allow  the  user  to  access  it  over  time  and  at  different 


locancms.  Tbc  fonna  is  also  QS^  to  users  because  &ey  can  keep  trade  af  didr  pirgicss 
and  dedsaons  willing  on  the  pampbkL  Tb^  need  no  training  to  use  tbe  system  because 

they  already  know  bow  to  lead,  understand,  etc.  de  sn  ^dlls  cecessaiy  for 

efifeedve  use  of  our  system  now. 

Our  inodd  also  provides  us  with  a  means  crfoiganizing  the  criteria  and  rules 
associated  widi  the  user  based  desa^ikn  die  fade,  Le.,  in  a  tenqwial  sequence  winch 
matdies  the  peicqitioas  bow  new  car  boyas  approach  the  problem.  Criteria  that  pl^  an 
important  role  eaily  in  Cie  process  are  located  “eariiei^  in  the  panqifalef  so  the  user  accesses 
the  DSS  according  to  where  dbe  is  in  the  problem  solving  process  (as  (^iposed  to  die  earlier 
arbitrary  listing).  (NOTE:  If  we  had  ded&d  to  use  a  conqmteiized  format  fm  our  DSS, 
these  features  could  also  be  accommodated  such  that  die  user  would  not  have  to  leam  very 
much  to  access  the  system’s  knowledge  b^xause  its  cxganizatum  would  be  exaedy  the 
same). 

The  evaluation  of  our  ^stem  (the  “appnqiriate”  technology  evaluation  was  dealt 
with  above),  begins  with  an  examination  of  the  coverage  of  the  knowledge  base,  which  is 
addressed  in  the  knowledge  acquisition  and  analysis  phases.  By  sampling  across  variables 
that  would  seem  to  account  for  variance  in  users’  peicqirions  of  the  problem  solving 
process  (e.g.,  new  buyers  versus  experienced  buyers),  we  begin  to  find  that  talking  to  more 
people  does  cot  give  us  any  new  insight  into  the  task  definition  process.  Once  we  have 
begun  to  experience  redundancy  in  aedvides  and  information  across  pertinent  sampling 
variables,  v/e  can  stop.  While  :t  is  probably  tnie  that  om*  ran^e  is  not  100%  complete,  it  was 
evaluated  in  much  the  same  way  that  the  expert-derived  range  (i.e.,  redundancy  among 
experts  versus  redundancy  among  end  users).  Further,  our  range  is  broade*'  than  the  expert- 
derived  range  and  includes  criteria  d.at  users  see  as  importai.t  (e.g.,  the  color  of  the  car ’s 
interior)  that  would  not  have  occurred  to  experts.  The  significant  differences  here  arc  that 
first,  the  resultant  range  of  criteria  is  oriented  to  how  people  acmally  solve  the  problem  of 
buying  new  cars  as  opposed  to  how  experts  intuit  that  the  problem  should  be  solved,  and 
second,  that  the  range  is  organized  and  presented  to  the  user  in  a  process  oriented  manner 
(i.e.,  a  problem  solving  time  line). 

Other  evaluation  comparisons  might  include  asking  another  set  of  recent  car  buyers 
or  current  new  car  buyers  to  evaluate  the  usefuln*’  .s  of  the  various  criteria  and  to  se^  if  they 
have  concerns  or  questions  which  our  system  does  not  include  (as  opposed  to  a  traditional 
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saiisfactioa  measoie).  TMs  would  tell  OS  not  <Hily  whsie  our  sys!^  is  inoHiqjlete  or  does 
cot  masch  wfaax  users  need,  but  it  will  also  the  system  designers  ^>ecific  data  cm  what 

has  to  be  changed  and  bow  it  needs  to  be  dianged.  If  we  had  a  computoized  fonnat  fOT  our 
^stem,  wMcfa  would  be  ofganized  in  a  manner  vexy  similar  to  our  printed  ftHmat,  we  could 
also  test  various  netwoilc,  conqniter  plaifonn  and/or  conqtuter  locations  variables,  all 
wiriiin  die  context  of  the  usefulness  vis-a-^  the  actual  task  rather  than  testing  user 
perfonnance  against  an  ariritraiy  expert-doived  standard.  Fiirdicr,  the  evaluatkm  mteria 
are  derived  from  the  task  itself  and  its  associated  oignitive  bdiavims  (e.g.,  reduction  of 
uncertain^')  rather  than  inherited  fiom  behavins  that  are  assumed  to  be  relevant  (e.g., 
abili^  to  recall  content,  time  on  task,  etc.). 

A  COMPARISON  OF  EVALUATION  STRATEGIES  AND  CRITERIA 

Criteria  used  for  evaluation  of  systems  can  be  divided  into  two  major  categories: 
efSdency  and  effectiveness.  Efficient^  criteria  axe  the  best  known  in  the  area  of  formal 
evaluation  and  are  accordingly  readily  recognized.  Efficiency  criteria  deal  with  saving  time 
and  money,  and  with  conespondence  with  an  external  standarcL  Examples  would  include: 


•  coverage  -  used  here  to  compare  the  ranges  of  criteria  between  the  expert- 
driven  and  the  application-based  approaches  to  our  hypothetical  DSS  for  car 
buyers.  The  contrast  between  the  two  examples  illustrates  differences  in  the 
knowledge  acquisition  procedures,  in  the  validity  of  the  resultant  ranges,  and  in 
the  abiliy  of  the  ranges  to  meet  the  needs  of  the  end  users; 

•  user  demography/personaliiy  variables  -  used  here  to  explain  why  certain 
types  of  users  do  col  find  the  system  useful  in  the  first  DSS  approach  and  to 
assure  coverage  of  liie  problem  solving  behaviors  and  criteria  through  sampling 
procedures  in  the  second; 

•  accuracy  -  used  in  the  first  DSS  example  to  compare  what  experts  think  users 
should  do  as  opposed  to  what  the  users  actually  do  whereas  in  the  second 
example,  the  assumption  that  there  was  no  “correct”  way  to  solve  the  problem 
(which  is  the  case  vrith  the  majority  of  the  emerging  automated  applications) 
transformed  this  criterion  into  a  “goodness  of  fit”  between  the  aggregated  user 
perceptions  and  the  system  model;  and 

•  time  considerations  -  usually  operationalized  as  the  time  required  for  a  user  to 
accomplish  a  particular  task  associated  with  the  intent  of  the  system,  they  are 
reflected  here  as  the  time  users  spend  accessing  the  DSS  in  the  first  example.  In 
the  second  example,  time  is  seen  as  a  major  task  model  organization  logic, 
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taking  advantage  of  how  people  experience  the  problem,  how  th^  tend  to  talk 
about  it,  etc. 

Efif&mveness  criteria  reflect  a  relatively  recent  concern  in  system  and  technolo^ 
evaloarion,  primarily  because  effectiveness  is  oriented  towards  the  system  user  while 
efiSciracy  criteria  have  been  orioited  towards  the  ^siem  designos  and/or  the  organization 
wifliin  which  the  uso*  works.  Given  the  surge  in  interer  *  in  “user  friendliness”  in  the  last 
few  years,  it  is  natural  that  effectiveness  criteria  are!  ren  as  more  usefirL  They  tend 

to  address  the  quality  of  the  system  in  matching  to  flu  .radons  and  needs  of  the  user. 

If  carefully  conceptualized  arul  r^reraiionalized,  th^  can  also  give  the  system  designer 
significant  inright  to  ^stem  structuring.  Examples  used  here  include: 

•  satisfactirm  -  used  in  the  first  DSS  example  as  either  a  binary  measitre  Q.e.,  the 
user  evaluates  the  system  and/or  system  components  as  either  satisfactory  or  not 
satisfactory)  or,  if  pursued  qualitatively,  can  indicate  problems  with  the 
knowledge  base,  system  implementation,  etc.  but  no  specific  system 
improvement  data.  The  analog  to  satisfaction  for  the  second  example  woirld  be 
to  have  car  buyers  use  and  comment  on  the  ^stem  in  terms  of  its  completeness 
and  utiliQr  which  provide  system  designers  with  the  “locations”  of  problems 
(i.e.,  where  in  the  task  model  they  arc)  as  well  as  some  insight  into  how  the 
system  has  to  change  to  be  more  useful; 

•  system/user  mismatch  •  related  to  the  use  of  the  nodon  of  accuracy  in  the 
effectiveness  criteria  above,  this  kind  of  criterion  looks  for  differences  in  the 
way  that  systems  are  implemented  (e.g.,  appropriate  technology  evaluations), 
the  content  of  the  system  (e.g.,  differences  in  user  perceived  car  buying  criteria 
and  those  extant  in  the  system),  and  the  structuring  of  the  system  features  (e.g., 
time  ordered  versus  arbitrary  organization); 

•  usefulness/usability  -  this  criterion  addresses  concerns  like  the  amount  of 
training  required  (i.e.,  the  training  overhead)  and  ease  of  use  from  the 
perspective  of  the  users,  the  abflity  of  the  user  to  “navigate”  through  the  system 
(see  Newby  1991  for  a  description  of  this  specific  criterion)  in  terms  of  the 
user’s  perception  of  where  s/he  is  relative  to  flie  actual  problem  being  solved 
and  whether  or  not  the  user  has  a  sense  of  where  to  go  next.  Although  this 
criterion  is  being  used  more  often  in  more  conventional  system  evaluation,  it 
focuses  more  on  expert-  or  system-derived  concerns  rather  than  user 
perceptions;  and 

•  information  needs  -  associated  with  the  use  of  any  system  is  the  information 
needed  to  use  the  system  as  well  as  the  information  (particularly  in  a  DSS) 
needed  for  the  user  to  maximize  his/her  benefits  in  solving  the  problem.  In  most 
system  designs,  the  information  functions  are  secondary  and  not  conceptually 
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linked  to  the  system  content  and  stiuctuie. 

This  contrast  of  efficiency  and  effectiveness  criteria  is  not  meant  to  be  an  e)diaustive 
con^arison  of  these  two  categories,  rather,  it  is  intended  to  give  the  reader  a  sense  that  the 
^plicadon-based  udiity  approach  to  system  desgn  and  evaluation  promises  to  add  some 
valuable  insist  into  die  des^  of  the  increasingly  complex  applications  now  being 
developed-  This  in  turn  will  provide  client  organizations  (e.g.,  the  government,  private 
sector  businesses,  etc.)  the  basis  upon  which  to  make  deci^ons  about  specific  technological 
innovations  like  display  tecbnolojses  as  well  as  system  design  features  in  terms  of  (heir 
specific  problem  solvit^  needs.  It  is  this  latter  capabili^r  that  promises  to  be  the  most 
valuable  in  improving  the  development  of  efficient  and  effective  systems. 

The  discussion  above  was  intended  to  lay  the  groundwork  for  an  examination  of  the 
applitability  of  this  approach  for  the  emer^g  technology  of  virtual  reality  (VR).  What 
follows  is  a  description  of  some  of  the  features  of  the  technology,  followed  by  a  description 
of  its  potendal  domains  of  use. 

CUTTING  EDGE  SYSTEM  TECHNOLOGY;  VIRTUAL  REALITY 

Virtual  reality  is  a  term  given  to  the  extension  of  the  quality  i '  immersion  (i.e., 
providing  the  human  senses  with  the  illusion  of  existing  in  an  electronic  place)  of  the  older 
but  still  recent  technology  called  multimedia.  This  means  that  one  or  more  of  the  human 
senses  (seeing,  hearing,  touching,  smelling,  tasting)  is  presented  with  a  stimnius  that 
appears  to  be  “real”  and  present  (i.e.,  in  three/four  dimensions)  rather  than  electronically 
generated  through  the  presentation  of  visual,  aural,  tactile,  etc.  clues.  Although  the  three 
levels  or  types  of  VR  presented  here  are  oversimplified  and  focus  on  distinctions  between 
display  and  pointing  devices,  tliey  do  help  to  demonstrate  a  need  for  a  different  approach 
to  evaluation. 

There  are  (roughly)  three  levclsor  types  of  teslinobgy  used  in  current  applications 
of  VR.  The  first  level,  which  I  call  “low-tech”  uses  readily  available,  existing  computer  and 
electronic  media  technology  (e.g.,  large  2-D  screens,  stereo  or  up  to  seven  channel  sound, 
video  cameras,  microphones,  etc.)  to  provide  an  approximation  of  immersion.  Large  screen 
T\'’s  with  surround  sound  and  cir.emax  theaters  are  examples  of  this  level  of  technology. 
Particularly  in  the  sound  area,  there  are  significant  developments  in  the  technology  that 
have  existing  and  expanding  consumer  markets  (e.g.,  home  theaters  with  Dolby  Pro  Logic 


sound  generated  by  high  resolution  laser  disc  players).  By  including  computers  with  this 
kind  of  technology,  VR  designers  have  been  able  to  suggest  a  wide  variety  of  applications 
for  this  level  of  technology  to  include  exercise  applications  where  the  user  jogs  or  rides  a 
bicycle  through  exotic  landscapes  with  their  attendant  sounds.  Existing  non-media 
equipment  (e.g.,  the  stationary  exercise  bicycle  or  a  treadmill)  is  used  to  provide  tactile 
feedback  in  some  applications.  Some  of  the  earlier  (and  sdll  used)  training  and  simulation 
applications  which  used  several  synchronized  screens  to  simulate  the  various  views  out  of 
an  airplane  cockpit  and  hydraulic  platforms  to  provide  tactile  feedback  belong  in  this 
category.  Other  application  examples  include  interactive  entertainment  systems  like  games 
and  music  creation  applications.  Also  at  this  level  would  be  included  some  of  the  remote 
sensing  and  robotic  applicadons  on  conventional  2-D  display  devices  that  use  shading  and 
perspective  to  give  the  illusion  of  3-  ot  4-D  images  like  architectural  CAD  applications. 
Finally,  many  data  visualization  and  imaging  applications  could  be  described  as  this  kind 
ofVR. 

The  second  level  or  type  involves  the  use  of  special  display,  user  tracking,  pointing 
and  tactile  feedback  technology  that,  for  the  most  part,  has  been  developed  especially  for 
VR.  The  current  state-of-the-art  example  would  be  a  set  of  “eyephones”  like  those 
developed  by  VPL  that  have  one  LCD  television  screen  for  each  eye,  the  images  for  which 
are  computer  processed  and  delivered  stereoscopically.  This  device  is  attached  to  the 
computer  via  an  umbilical  cord  which  either  trails  behind  the  user  or  is  linked  to  a  boom 
that  keeps  its  weight  and  cumbersomeness  at  a  minimum.  Also  attached  to  this  device  are 
a  special  tracking  device  (usually  magnetic  although  sometimes  ultrasonic)  that  tells  the 
computer  where  the  user  is  in  relation  to  the  electronic  space  and  a  set  of  headphones  for 
sound.  Although  binaural  sound  cues  for  simulating  3-D  sound  is  technically  pos.sible,  this 
embellishment  is  seldom  incorporated  into  VR  setups;  simple  2  channel  stereo  sound  is 
more  common.  Also  included  in  this  type  of  VR  is  some  kind  of  pointing  device,  with  or 
without  tactile  feedback.  One  of  the  most  common  devices  is  called  a  “data  glove”  which 
incorporates  another  tracking  device  to  tell  the  computer  where  the  user’s  hand  is  and  some 
fiber  optic  devices  that  tell  the  computer  the  positions  of  the  fingers  (i.e.,  whether  the  user 
is  pointing  a  finger  or  making  a  fist,  etc.).  Some  applications  at  this  level  also  include  non¬ 
media  equipment  to  provide  more  holistic  tactile  feedback  like  an  architectural  design 
system  at  the  University  of  North  Carolina  at  Chapel  Hill  that  uses  a  treadmill  wi;h 
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handlebars  in  place  of  the  data  glove  and  tracker  combination  to  allow  the  user  to  navigate 
through  electronic  buildings. 

Some  of  the  existing  applications  of  this  level  of  technology  include  medical 
imaging  for  planning  delicate  surgical  operations  and  for  focusing  and  aiming  radiation 
beams  for  cancer  treatment  There  are  also  a  varied  of  immersion  entertainment 
applications  which  are  still  in  the  development  stages.  One  example  of  the  latter  is  a  project 
reportedly  underway  in  Japan  to  create  a  VR  amusement  paric  where  participants  would  feel 
like  they  were  riding  a  rollercoaster.  Although  significant  and  important  advances  are  being 
made,  many  severe  limitations  to  applications  in  this  category  of  VR  exist  aside  from  the 
cost  of  most  of  these  devices,  which  is  astronomical.  These  would  include  the  need  for  an 
umbilical  cord  to  link  the  user  to  the  computer  system,  the  less-than-satisfactory  resolution 
of  the  display  devices  (e.g.,  the  VPL  eyephone  cannot  display  easily  readable  text),  the  lack 
of  processing  power  to  create/process  and  present  real-time  images,  realistic  tactile 
feedback  mechanisms,  high  speed  networks  to  deliver  data  from  distributed  databases  and/ 
or  to  deliver  signals  to  multiple  users  who  are  not  co-located,  and  database  and  network 
management  procedures  for  synchroniang  and  integrating  the  various  data  types. 

The  last  type  of  VR  represents  what  are  called  “volumetric”  display  technologies 
with  more  naturalistic  human-computer  interaction  capabilities.  In  the  display  area, 
volumetric  displays  tend  to  be  generated  using  laser  technology,  either  holographic  or 
where  a  computer  controls  a  laser  which  is  focused  on  pulsing  mirrors  or  rotating  helixes 
creating  a  3-  or  4-D  image  that  requires  no  paraphernalia  physically  attached  to  the  user. 
Further,  more  than  one  user  can  see  the  displayed  image.  Human-computer  interaction  in 
this  type  of  VR  also  frees  the  user  from  umbilical  connections  to  the  computer  with  voice 
recognition  interfaces  and  low-powered  laser  pointing  devices  aimed  at  a  light  sensitive 
display  surface  (for  menu  selections,  etc.).  Unlike  the  first  two  types  of  VR,  this  type  is 
making  use  of  very  different  tex^hnologies,  many  of  which  have  not  been  adopted  or  derived 
from  existing  technological  applications.  Suggested  applications  here  are  in  multi-user 
contexts  or  for  elaborate/hazardous  remote  control  situations  (e.g.,  NASA’s  full-parallax 
holographic  display  system  for  remote  operations).  One  possible  example  would  be  a 
multi-user  command,  control,  communication  and  intelligence  (C^l)  system  with  sub¬ 
systems  that  deal  with  user  training,  plaiming,  crisis  management  as  well  as  on  going 
tactical  command.  The  limitations  of  this  type  of  VR  arc  included  in  the  second  level  but 
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also  include  the  nascent  state  of  voice  recognition  technology,  providing  color  to 
holographic  and  laser  derived  images,  and,  more  obviously  than  in  the  other  levels,  system 
design  capability  because  of  the  complexity  of  envisioned  application  possibilities. 

Conceptually,  there  are  three  sources  of  images  for  the  visual  aspects  of  VR:  digital 
and/or  analog  representations  direa  from  the  physical  environment  (e.g.,  stereo  camera 
images),  images  derived  from  specific  data  points  and  streams  (e.g.,  much  of  the  work  done 
in  the  data  visualization  area),  and  artificial  data  (e.g.,  animations).  Each  of  these  sources 
will  ultimately  play  a  significant  role  in  creating  usable  VR  systems  and  each  of  them 
represent  significant  and  somewhat  unique  technical  problems  before  they  become  reliable 
for  envisioned  VR  applications. 

Most  of  the  work  currently  being  done  in  the  VR  area  focuses  on  the  development 
of  the  technology  itself  -  display  devices,  pointing  devices,  tactile  feedback,  image 
processing  capability  and  strategy,  data  visualization,  etc.  Almost  all  of  this  effort  is 
directed  towards  VR  emulations  of  actual  physical  reality.  It  is  very  important  to  note  that 
VR  presents  another  possibility  that  is  equally  compelling  and  perhaps  ultimately  much 
more  valuable. 

Human  beings  have  been  able  to  process  about  300  characters  per  second  (on  the 
average)  since  tlte  development  of  writing.  None  of  the  technological  innovations  in 
information  production,  dissemination  and  display  to  date  have  improved  on  tliat 
processing  rate  -  people  still  read  about  300  characters  per  second.  However,  reading  is  a 
unimodal  sensory  capability.  Researchers  in  communication,  perception,  etc.  know  that 
multimodal  capacities  are  m.ore  accurate  and,  depending  on  the  nature  of  the  sensory  input 
data,  capable  of  exceeding  the  bandwidth  of  any  single  sensory  channel.  For  example, 
human  beings  tend  to  “lead"  with  their  eyes  when  trying  to  identify  the  direction  a 
panicular  sound  is  coming  from  but  very  quickly  (and  often)  correlate  auditory  clues  with 
visual  clues.  Together,  these  two  sensory  modes  arc  more  accurate  than  either  one  of  them 
alone. 

If  we  take  this  understanding  and  add  it  to  the  potential  bandwidth  capability  of  VR 
technology  without  that  bandwidth  being  devoted  to  maintaining  a  physical  reality  analog, 
what  we  have  is  a  potential  for  organizing  and  representing  data  and  data  streams  such  that 
users  can  exceed  the  300  character  per  second  limit  The  only  example  of  this  that  I  have 
he?ir(l  about  is  the  rumored,  on  again  •  off  again  project  (currentlv  rumored  to  be  on  again) 
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involving  IBM  and  NYNEX  (among  others)  where  the  goal  is  to  transform  and  display  all 
the  information  and  data  that  a  Wall  Street  financial  analyst  needs  to  make  an  intelligent 
purchasing  decision  on  a  particular  stock.  The  idea  is  that  by  transforming  the  existing 
extensive  alphanumeric  data  streams  currently  seen  as  useful  in  making  such  a  decision  into 
multimodal  representations  in  a  four  dimensional  volume,  the  analyst  might  be  able  to 
make  sense  out  of  these  diverse  data  streams  faster  and  with  more  accuracy  than  is  currently 
possible.  To  some  researchers,  myself  included,  this  possibility  promises  ultimately  to  be 
much  more  valuable  than  the  current  emphasis  on  physical  reality  representations. 

PUTTING  VR  AND  APPLICATION-BASED  UTILITY  EVALUATION 
TOGETHER 

Given  that  current  system  design  and  system  evaluation  are  based  upon  technology- 
driven  and  expert-driven  approaches  (see  Taylor  1986  for  a  more  complete  discussion  of 
this  issue)  and,  given  that  the  current  VR  development  emphasis  is  following  in  this  trend, 
it  seems  unlikely  that  the  exploitation  of  VR  for  increasing  the  human-computer  interaction 
bandwidth  will  proceed  as  quickly  or  as  productively  as  it  might  by  a  more  coherent 
inclusion  of  the  cognitive  behaviors  associated  with  the  problem  that  the  system  is  intended 
to  solve  (i.e.,  a  more  comprehensive  acconunodation  for  the  application-based  utility  of  the 
various  system  hardware  and  software  features).  While  I  might  argue  that  the  exploitation 
of  VR  technology  for  the  current  physical  space  analog  applications  would  also  be  more 
efficient  and  effective  by  the  same  argument,  it  is  the  second,  and  largely  ignored, 
application  area  that  is  more  compelling. 

The  proposed  application-based  utility  approach  provides  several  conceptual, 
methodological  and  analytic  advantages  over  existing  system  design  and  evaluation 
approaches  which,  when  used  in  conjunction  with  the  existing  approaches  promises  to 
facilitate  the  timely  development  of  more  usable  systems  in  this  age  of  increasing 
complexity.  To  wit,  the  proposed  approach: 

•  focuses  much  more  coherently  on  the  problem  to  be  solved; 

•  systematically  employs  the  perceptions  of  system  users  in  knowledge  base 
development  and  in  system  configuration; 

•  provides  an  application/task  model  that  can  be  used  to  evaluate  the 
appropriateness  of  technological  system  features  (e.g.,  when  does  increased 


resolution  actually  help  the  user  do  what  s/he  needs  to  do); 

•  provides  evaluation  criteria  and  methods  that  address  gaps  in  more  conventional 
evaluation  approaches; 

•  draws  from  human-computer  interaction  perspectives  that  focus  directly  on  the 
cognitive  communication  behaviors  in  the  context  of  a  particular  problem; 

•  can  provide  for  a  significant  reduction  in  training  overhead;  and 

•  can  be  scaled  up  to  provide  design  insight  for  massive,  complex  systems. 


NOTES 

1.  If  indeed  it  was  ever  straightforward.  For  example,  one  can  look  around  any 

organization  and  see  countless  examples  of  $10,000.00  technical  solutions  to  10-cent 
problems  like  work  stations  with  high  resolution  graphics  displays  being  used 
exclusively  for  word  processing. 


2.  See  Newby,  Nilan  &  Duvall  1991  for  a  discussion  of  the  individual  differences 
approach  to  system  design. 
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OPTICAL  FIBER  AMPLIFIERS  AND  OSCILLATORS 


Kenneth  J.  Teegarden 
Salahuddin  Qazi 


l.lntrcxluction 

Optical  amplifiers  and  oscillators  based  on  fiber  waveguides  doped  with  luminescent 
impurities  have  been  studied  for  many  years.  Recently  the  advantages  of  in  line  op¬ 
tical  amplifiers  for  fiber  communication  systems  have  been  recognized  and  this  has 
led  to  intense  efforts  to  develop  amplifiers  for  the  wavelengths  where  commercial 
monomode  optical  fibers  have  the  lowest  loss  and  can  be  designed  to  have  zero 
dispersion,  namely  at  about  1.30  and  1.50  micro  'is.  The  most  successful  amplifier  to 
date  is  one  using  the  rare  earth  erbium  which  luminesces  at  1 .55  microns. 
Developmental  models  of  erbium  doped  fiber  amplifiers  which  achieve  small  signal 
gains  of  up  to  30db  and  power  amplification  of  13clbm  are  currently  on  the  market  and 
are  being  evaluated  in  the  field.  These  amplifiers  are  pumped  with  relatively 
expensive  strained  quantum  well  laser  diodes  operating  at  .980  or  1.48  microns.  An 
increase  in  available  pump  power  and  a  reduction  in  cost  could  be  achieved  if  efficient 
operation  at  a  pump  wavelength  of  .800  microns  was  possible  and  the  discovery  of  a 
glass  composition  which  would  eliminate  excited  state  absorption  of  this  pump 
wavelength  would  be  an  important  breakthrough.  Also,  an  increase  in  the  optical 
bandwidth  of  erbium  based  amplifiers  would  permit  the  advantages  of  wavelength 
multiptiplexing  to  be  more  fully  realized.  This  again  is  a  problem  whose  solution 
depends  on  the  development  of  glass  compositions  which  widen  the  gain  curve  of 
erbium,  even  at  the  expense  of  maximum  gain. 

Most  of  the  optical  fiber  embedded  today  operates  at  1 .3  microns  and  no  practical 
amplifier  based  on  silica  fibers  has  as  yet  been  developed  for  that  wavelength 
although  some  promising  systems  based  on  neodymium  in  non  silica  glasses  have 
been  discovered.  The  search  for  a  1 .30  micron  fiber  amplifier  continues  to  be 
vigorously  pursued  in  several  laboratories  with  emphasis  being  placed  on  the 
discovery  of  new  dopants  for  silica  fibers  with  transitions  at  1.3  microns. 

Recently,  erbium  doped  fibers  have  been  employed  as  optical  oscillators,  or  lasers,  as 
well.  In  particular,  they  have  been  used  to  construct  tunable,  mode  locked,  traveling 
wave  ring  lasers  that  produce  trains  of  50  ps  wide  pulses  with  repetition  rates  of 
6GHz.  Such  oscillators  are  proposed  for  use  in  very  high  band  pass  digital 
communications  systems  where  they  would  replace  solid  state  diode  lasers  as  the 
primary  signal  source  for  all-fiber  communications  systems.  The  application  of  fiber 
ring  lasers  to  communications  systems  is  not  as  advanced  as  is  the  case  for  fiber 
amplifiers.  No  developmental  models  are  currently  available,  indicating  that  the 
parameters  for  stable  operation  of  the  device  have  not  as  yet  been  established.  It  is 
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clear,  however,  that  the  same  materials  problems  exist  for  the  oscillator  as  exist  for  the 
amplifier.  There  is  a  need  for  dopants  and  glass  compositions  that  increase  tuning 
range  and  permit  operation  at  other  wavelengths,  such  as  1.3  microns. 

The  work  described  herein  was  carried  out  in  the  Air  Force  Photonics  Center,  Rome 
Laboratories,  Griffiss  Air  Force  Base,  New  York.  Its  objective  was  to  establish  in  house 
expertise  in  the  area  of  fiber  based  luminescent  amplifiers  and  oscillators  as  a 
foundation  forsubsequent  efforts  to  develop  new  devices  and  incorporate  them  into 
communications  systems  of  interest  to  the  Air  Force.  The  specific  project,  undertaken 
during  the  summer  of  1991 ,  was  the  construction  and  characterization  of  a  state  of  the 
art  erbium  doped  fiber  amplifier  using  commercially  available  components. 

II.  Experimental 

The  experimental  set-up  used  in  most  of  the  measurements  is  shown  in  figure  1 .  The 
erbium  fiber  used  as  the  gain  medium  was  supplied  by  Dr.  W.  J.  Miniscalco,  GTE 
Laboratories.  It  had  a  core  diameter  of  8.0  microns  and  an  outer  diameter  of  100 
microns.  A  commercial  multiplexer  (JDS  FITEL  Inc.,  model  #  WD915F-9S/N  Y53)  was 
used  to  combine  an  optical  signal  at  1.5275  microns  with  pump  light  from  various 
sources.  The  single  mode  optical  signal  was  generated  by  a  distributed  feedback  laser 
in  a  commercial  lightwave  transmitter  (Lasertron  model  #  QLX-IOOO).  Preliminary  data 
using  a  frequency  doubled  Nd'.YAG  laser  (ADLS  Model  #  300)  operating  at  .530 
microns  as  a  pump  source  was  obtained,  but  only  data  using  a  strained  quantum  well 
laser  (  Spectra  Diode  Labs  Model  #  S9075 )  operating  at  .980  microns  as  pump  is 
includ  J  in  this  report.  Commercial  single  mode  fiber  patch  cords  designed  to  operate 
at  1.3  microns  and  terminated  with  FC/PC  connectors  were  fusion  spliced  onto  both 
ends  of  the  amplifier  fiber,  and  onto  the  input  end  and  the  signal  output  end  of  a 
wavelength  demultiplexer  (Gould,  Inc.  model  #  980-1550-CXW-MX-02x02-01).  Similar 
fibers  with  connectors  were  fusion  spliced  onto  the  ou^ut  and  signal  input  of  the 
multiplexer,  while  the  pump  input  end  of  this  device  was  left  bare,  carefully  cleaved 
and  mounted  in  an  xyz  positioner.  Light  from  the  pump  sources  was  collimated  when 
necessary  and  focused  onto  the  core  of  the  multiplexer  input  with  appropriate 
microscope  objectives.  Although  the  connectors  used  in  this  set  up  represented 
avoidable  losses,  they  were  used  so  that  measurements  of  optical  power  could  be 
made  at  various  points  in  the  fiber  system.  In  a  real  amplifier  they  would  have  been 
replaced  by  fusion  splices. 

The  overall  gain  of  the  amplifier  and  its  emission  spectrum  were  measured  as  a 
function  of  pump  power  and  signal  power  for  the  pump  wavelengths  given  above.  The 
emission  spectrum  was  determined  using  an  optical  spectrum  analyzer.  The  overall 
gain  of  the  system  was  measured  in  two  ways:  either  the  peak  output  power  was 
measured  on  the  spectrum  analyser  for  a  measured  peak  input  power,  or  the  lightwave 
transmitter  was  amplitude  modulated  with  a  33  MHz  sinusoidal  RF  signal  and  the 
resulting  AC  input  and  outpur  signals  measured  with  a  commercial  lightwave  receiver 
(Lasertron  model  tf  QRX-700)  and  an  oscilloscope,  in  both  cases  the  input  signal  was 
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measured  at  the  output  end  of  the  demuiltiplexer,  thus  losses  in  this  device  were 
excluded  from  the  determination  of  overall  gain.  Measurements  of  pump  power 
injected  into  the  amplifier  ware  made  with  a  Newport  power  meter  ( model  #  835), 
again  at  the  output  end  of  the  demultiplexer. 

A  determination  of  the  gain  of  the  doped  fiber  alone  was  made  by  separately 
measuring  the  losses  at  1.55  urn  which  occured  in  the  demultiplexer,  connectors  and 
fusion  splices  and  correcting  the  overall  gain  for  these  losses.  A  Fotec  power  meter 
was  used  for  these  measurements. 

III.  Results 

A.  Emission  Spectrum  ’ 

The  emission  spectrum  of  the  erbium  doped  fiber  pumped  at  .980  microns  with  20  mw. 
of  power  is  shown  in  figure  2.  This  spectrum  is  identical  to  data  published  in  the 
literature  to  within  the  calibration  of  the  spectrum  analyzer  used,  except  for  a  periodic 
ripple  which  is  superimposed  on  the  more  slowly  varying  envelope  of  the 
fluorescence.  The  magnitude  of  this  ripple  and  the  shape  of  the  emission  curve  were 
dependent  on  pump  power  and  the  exact  way  the  pump  radiation  was  launched.  For 
example,  if  the  end  of  the  input  fiber  to  the  multiplexer  was  moved  very  slightly  from  the 
/position  corresponding  to  figure  2.,  the  spectrum  shown  in  figure  3a.  was  obtained. 
Here  the  ripple  is  much  more  pronounced.  The  ripple  moved  slowly  in  time  relative  to 
the  emission  band  and  this  gave  rise  to  periodic  fluctuations  in  the  gain  of  the 
amplifier. 

If  the  connectors  in  the  systems  were  loosened  and  held  in  the  right  position,  even 
more  drastic  changes  in  the  emission  spectrum  occured.  This  is  illustrated  in  figure 
3b.,  where  the  emission  at  peak  of  the  spectrum  appears  greatly  enhanced  as  well  as 
the  ripple.  In  fact  at  maximum  pump  power,  obvious  lasing  action  could  be  induced  as 
can  be  seen  from  figure  4.  These  effects  most  likely  ocurred  because  of  feedback  from 
reflections  at  coupler  inte^aces  or  fiber  ends  and  could  be  minimized  by  careful 
allignment  and  tight  couplers.  It  should  be  noted,  however,  that  because  they 
represent  oscillations  in  the  gain  curve  they  cause  instablities  in  the  gain  of  the 
amplifier.  An  improvement  in  amplifier  operation  would  have  occured  if  optical 
isolators  were  included  in  the  amplifier  design,  and  this  is  planned  in  future  work. 

B.  Amplifier  Gain 

The  overall  gain  of  an  amplifier  based  on  a  4.75  meter  length  of  erbium  fiber  was 
measured  using  the  first  method  described  above.  Figure  5  illustrates  how  this 
measurement  was  made.  Figure  5a  shows  the  signal  laser  power  at  the  output  of  the 
multiplexer  after  attenuation  with  a  variable  optical  attenuator  and  figure  5b  gives  the 
amplified  output  for  a  pump  power  of  20  milliwatts  at  .980  microns.  Note  that  the  peak 
of  the  gain  curve  for  erbium  lies  at  about  1.5303  microns  so  that  the  signal  in  this  case 
did  not  coincide  with  tiie  wavelength  for  maximum  gain.  The  values  of  gain  reported 
here  are  therefore  not  peak  values  but  are  about  one  half  the  gain  expected  at  1.5303 
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microns.  The  overall  gain  calculated  from  the  ratio  of  the  output  to  the  input  signal  in 
this  case  is  9.0  or  9.5  db.  The  overall  gain  of  a  4.75  meter  long  fiber  measured  in  this 
way  as  a  function  of  launched  .980  micron  pump  power  for  an  input  signal  power  of  30 
nw  is  shown  in  figure  6. 

The  transmission  of  the  4.75  meter  fiber  at  .980  microns  as  a  function  of  launched 
pump  power  ( power  at  the  out  put  of  the  multiplexer)  is  shown  in  figure  8.  Clear  signs 
of  saturation  are  apparent  in  this  figure.  That  is,  the  fiber  was  more  transparent  at  a 
launched  power  of  20  mw.  than  at  a  power  of  1.0  mw.  The  data  shown  in  figure  7.  were 
used  to  correct  the  data  in  figure  6.  and  generate  a  plot  of  the  overall  gain  of  the 
amplifier  as  a  function  of  absorbed  pump  power  rather  than  launched  pump  power. 

The  result  is  shown  in  figure  8. 

The  data  shown  in  figures  6.  and  7.  indicate  that  a  higher  gain  would  be  obtained  by 
using  a  fiber  of  longer  than  4.75  meters.  At  20  mw  of  pump  power,  lOmw  was 
transmitted  and  this  .0  mw  would  produce  a  gain  of  about  4.0  in  a  fiber  of  comparable 
length.  Therefore,  an  additional  three  meters  of  erbium  doped  fiber  was  fusion  spliced 
to  FC\PC  connedcrs  and  coupled  to  the  original  4.75  meters  of  fiber.  Measurements  of 
overall  gain  were  made  on  this  7.75  meter  fiber  using  the  second  method  described 
above  in  which  the  signal  laser  was  amplitude  modulated  with  a  33MHz  signal.  The 
results  of  these  measurements  are  shown  in  figure  9,  where  overall  gain  is  given  as  a 
function  of  pump  power  and  relative  signal  power  for  an  input  signal  modulation  of  1.0 
mV.  peak  to  peak.  A  strong  dependence  on  launched  pump  power  and  considerable 
saturation  at  maximum  signal  power  are  illustrated  by  this  data.  As  predicted,  this 
greater  length  of  fiber  showed  a  higher  gain  even  though  the  additional  couplers  and 
fusion  splices  introduced  additional  losses. 

Finally,  the  gain  of  the  doped  fiber  itself  was  estimated  by  correcting  the  maximum 
overall  gain  obtained  above  for  losses  incured  in  the  FC/PC  bulkhead  connectors  and 
the  two  fusion  splices  beteen  the  standard  .size  fiber  patch  cords  and  the  erbium  fiber. 
Losses  which  occured  in  the  fusion  splices  because  of  missmatch  between  core 
diameters  were  by  fa»'  the  largest  and  were  found  to  be  2.7  db  for  the  two  splices  in 
series.  The  measured  losses  in  the  connectors  was  about  .05  db  per  coupler  on  the 
average.  Since  there  were  three  couplers  involved,  the  total  loss  amounted  to  about 
2.9  db.  Thus  the  peak  fiber  gain  must  have  been  about  14  db.  at  a  launched  pump 
power  of  20  mw.  and  a  pump  wavelength  of  .980  microns.  It  should  be  noted  however, 
that  the  fiber  core  missmatch  in  the  fusion  splices  occured  because  the  core  of  the 
doped  fiber  is  made  smaller  than  the  standard  size  of  10  microns  inorder  to  increase 
pump  power  density  in  the  amplifying  fiber.  Thus  a  major  loss  in  this  system  is  inherent 
in  the  way  the  fiber  Is  aesigned.  Also,  in  an  operational  system  losses  in  both 
multiplexer  and  dv^multiplexer  must  be  included  in  a  specification  of  overall  amplifier 
gain.  These  were  found  to  total  3.2  db  at  the  signal  wave  length  of  1 .5275  microns. 
Finally,  v/e  noted  above  ttjat  the  wavelength  of  the  signal  did  not  match  the  peak  of  the 
erbium  gain  curve.  In  fact  the  peak  gain  of  the  fiber  was  about  twice  that  of  the  gain 
measured  at  the  signal  wavelength,  that  is  50  or  17  db. 
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IV  Summary 

This  project  illustrated  somr,  of  the  basic  problems  to  be  considered  in  the  construction 
of  an  erbium  fiber  based  optical  amplifier.  The  gain  of  the  amplifier  was  strongly 
dependent  on  the  launched  pump  power  and  the  efficiency  with  which  pump  radiation 
Is  injected  into  the  fiber  Is  therefore  of  critical  importance.  A  pig  tailed  pump  laser  might 
have  increased  the  pump  power  available  in  the  present  Investigation.  The 
dependence  of  the  gain  on  fiber  length  is  also  an  important  factor  in  amplifier  design, 
but  was  only  briefly  investigated  in  this  work.  The  major  losses  which  placed  a 
limitation  on  the  overall  gain  of  the  amplifier  were  found  to  lie  in  the  core  missmatch 
between  doped  fiber  and  standard  fiber.  One  way  to  avoid  this  problem  would  be  to 
increase  the  core  diameter  of  the  doped  fiber  to  10  microns  and  offset  the  reduction  of 
pump  power  density  by  inaeasing  launched  pump  power.  Since  pump  lasers  are 
continually  being  improved,  mis  is  probably  not  an  unreasonable  option.  Losses  in 
the  wavelength  multiplexer  and  demultiplexer  used  were  significant,  but  may  be 
reduced  in  future  models  of  these  devices.  Finally,  If  higher  gains  are  to  be  obtained  in 
the  amplifier  constructed  in  this  project,  inclusion  of  optical  Isolators  to  reduce  internal 
feedback  and  resultant  instabilities  is  of  critical  importance. 


10-6 


OJ^ctlic^s  c 


10-7 


10-8 


oocrr-Toi  iM 

■  •  VW*  • 

!>••  •••  ■ 


o<  oo  a£  «^»o' 

WR  1 


5oen 


3 


1 . 4r7'5  Mill 


t  5Z75 


1.5775 


RES'  0  •  5nm 


10rift  div 


i 


.^Onw-'di'.* 


o  1  ao  1  A  • 

••  •  ♦  ww  •  ww  *  .  •  ••  W 


0 


I  .^rr5 


■►■'m 


l;--V 


r4«ii 


—  c 


X  •  .*4-  I 


10-9 


10-10 


SPECTRUM 
58n  ••• 


5nw/di.w 


91.07.30  17:11 
AUR  1 


n-- 

.  I 

1  r- . 

i 

j  ! 
i  \ 


1 . 5225  1 . 5275 


1.5325 


RES  0.5nm 


(/IK. 5% 


Inm/div 


SPECTRUM 

500n 


50nw/div^ 


91.07.30  17:09 
^AUR  1 


(J 


1-5225  Him  1-5275  1-5325 


RES  “■.n.M 


Inm/div 


10-11 


GAIN 


Figure  6 


10-12 


transmission 


Figure  7 


10-13 


10 


Figure  8 


10-14 


0 


Collecting  Data  for  Markov  Models  of  Error 
Patterns  on  Data  Coininunications  Links 

Wayne  D.  Smith 

ABSTRACT;  Rome  Laboratory  is  currently  in  the  process  of 
procuring  Error  Injector  Units  (EIU)  to  model  the  error 
behavior  of  data  communications  channels.  A  major 
component  of  this  effort  deals  with  finding  Markov  models 
that  will  simulate  the  behavior  of  these  channels.  The 
objectives  of  this  effort  were  to  continue  the  research 
initiated  in  1990  to  find  Markov  tables  to  load  into  the 
EIUs.  A  portion  of  the  research  involved  searching  current 
literature  for  Markov  tables  or  error  gap  data.  Alternative 
approaches  to  obtaining  tho  desired  Markov  tables  were  also 
considered.  This  included  research  into  the  use  of 
simulation  models  to  produce  Markov  tables,  and  some  further 
work  on  the  concept  of  "heuristic  error  models."  The  most 
substantial  effort  was  devoted  to  the  design  of  a  data 
collection  experiment  that  could  be  used  at  Rome  Laboratory 
to  produce  the  needed  data.  This  effort  included  the  design 
of  a  microcomputer  interface  that  would  pemmit  a 
microcomputer  to  monitor  a  "Fireberd"  bit  error  rate  tester 
and  record  the  error  statistics  for  analysis  and  conversion 
to  a  Markov  table. 
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I.  INTRODUCTION: 

Rome  Laboratory  is  in  the  process  of  procuring 
specialized  computer  hardware  that  can  be  used  to  model  the 
error  behavior  of  a  number  of  different  data  communications 
channels  [1].  The  hardware  can  be  placed  in  line  in  any 
communications  link,  and  can  be  used  to  simulate  the 
occurrence  of  errors  on  that  link. 

As  each  bit  is  received  at  the  EIU  input,  the  logic  of 
the  unit  determines  whether  or  not  an  error  is  to  be 
generated  for  this  bit.  If  an  error  is  to  be  generated,  the 
incoming  bit  is  exclusive  ored  (EXOR)  with  a  one  bit 
produced  from  within  the  EIU  board.  This  operation  results 
in  an  inversion  of  the  incoming  bit  at  the  output  port  of 
the  EIU. 

The  EIUs  are  based  on  a  Markov  chain  that  models  the 
behavior  of  the  data  communication  links.  The  model  is 
internally  represented  with  an  n  state  (n<=8)  Markov  model, 
and  transition  between  states  is  controlled  by  an  8  x  8 
matrix,  called  the  Markov  table.  A  random  number  generated 
within  the  EIU  circuitry  determines  the  next  state  of  the 
model . 

The  Markov  tables  can  be  down  loaded  to  the  EIU  from  a 
SUN  controller,  or  entered  directly  from  the  EIU 
microcomputer  keyboard.  Once  the  table  has  been  loaded,  the 
EIU  will  model  the  error  behavior  of  the  link  being  tested 
as  determined  by  the  Markov  model.  A  major  component  of  the 
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EIU  project  is  concerned  with  finding  Markov  table  values 
that  will  produce  an  error  bit  string  that  will  mimic  the 
behavior  of  various  data  communications  links. 

It  is  well  known  that  ctata  errors  tend  to  occur  in 
clusters  or  "bursts"  that  are  divided  by  relatively  long 
periods  without  errors,  called  gaps.  Any  model  of  data 
errors  must  take  this  "bursty"  nature  of  real  errors  into 
consideration.  In  order  to  characterize  data  errors  in 
general,  it  is  convenient  to  define  a  simple  data 
communications  channel  model  as  shown  in  Figure  1. 


y,  =  X,  +  e, 


Figure  1;  Data  Communications  Model 

In  the  model,  (x)  represents  the  bit  string  of  the 
transmitted  data.  The  channel  corrupts  the  incoming  data 
with  interference  and  produces  the  output  data  string  (y). 
The  string  {e}  is  the  error  string.  A  one  in  the  string  e 
represents  an  error,  while  a  zero  represents  the  absence  of 
an  error.  The  "+"  operator  represents  modulo  2  addition  or 
an  exclusive  or  operation. 
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The  most  convenient  way  to  characterize  the  error 
distribution  involves  specifying  the  gap  distribution  of  the 
data.  A  gap  is  the  opposite  of  an  error  burst,  and  is 
represented  in  the  error  string,  (e),  as  a  sequence  of 
zeroes  in  the  string.  The  gap  distribution  is  a  measure  of 
the  number  of  times  that  a  gap  of  length  m  occurs  in  a 
particular  error  string.  By  definition,  a  gap  is  preceded 
by  an  error,  so  the  gap  error  probability  is  written  as: 
P(0"’|1). 

Gilbert  [3]  showed  that  the  error  performance  of  a  data 
link  can  be  characterized  by  a  two  state  Markov  model,  where 
one  state  is  the  one  that  may  generate  an  error  (B) ,  and  the 
other  state  is  error  free  (G) .  Discrete  probabilities  are 
associated  with  the  generation  of  an  error  in  the  B  state, 
and  also  with  the  likelihood  of  a  transfer  from  one  state  to 
the  other. 

Fritchman  [4]  extended  the  Gilbert  model  to  an  N  state 
model,  with  k  error  free  states,  and  N-k  error  states.  He 
also  showed  that  an  N  state  model  with  a  single  error  state, 
and  N“1  non-error  states  is  sufficient  to  represent  the  data 
distribution  from  a  data  communications  link.  State 
transitions  in  the  model  are  governed  by  a  Markov  table. 

In  striving  to  model  the  error  performance  of  a  data 
communications  link,  the  goal  is  to  discover  a  Markov  model 
that  will  produce  an  error  sequence  with  the  same 
distribution  as  the  observed  error  data.  A  number  of 
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researchers  have  collected  data  on  various  data 
communication  link  error  characteristics,  and  have  used  this 
data  to  define  the  structure  of  Markov  chains  that  would 
reproduce  the  observed  error  distribution  [4,  5,  6,  7,  8]. 
Varsbney  [9]  has  collected  much  of  this  data  in  an  RADC 
Technical  Report. 

II.  DIECJSSION  OF  THE  PROBLEM: 

The  objectives  of  the  research  effort  in  the  summer  of 
1991  were  to  continue  the  work  initiated  in  1990  to  find  the 
Markov  tables  needed  to  permit  the  EIUs  to  model  the 
characteristics  of  as  many  di f ferent  data  channels  as 
possible.  This  effort  included  additional  literature  review 
for  any  previous  research  that  had  produced  Markov  models 
that  were  suitable  for  use  with  the  EIUs.  Some  effort  was 
also  devoted  to  a  literature  search  for  error  data  that  was 
not  already  in  Markov  table  form,  but  that  was  suitable  for 
conversion  to  a  Markov  model. 

As  an  adjunct  to  the  literature  review,  alternative 
methods  for  obtaining  the  Markov  tables  were  investigated. 
This  included  some  work  on  the  heuristic  models  started  last 
summer,  as  well  as  the  potential  use  of  simulation  models  of 
data  communications  links  to  provide  gap  data  that  could  be 
reduced  to  the  Markov  table  format. 

Uith  the  realization  that  these  methods  would  probably 
not  produce  the  quantity  and  type  of  error  data  needed  for 
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the  EIUs,  the  major  effort  was  devoted  to  finalizing  the 
design  of  an  experiment  to  collect  the  needed  data  at  Rome 
Laboratory.  'T’his  design  included  the  development  of 
experiment  procedures,  the  design  of  a  computer  interface 
between  a  microcomputer  and  a  bit  error  rate  tester,  and  the 
initial  specification  for  the  software  needed  in  the 
microcomputer  to  collect  the  needed  data. 

Finally,  the  facilities  at  the  University  of 
Mississippi  that  are  used  to  reduce  the  gap  data  to  Markov 
tables  was  tested,  and  the  access  procedures  from  Rome 
Laboratory  were  developed.  The  reduction  of  gap  data  from  a 
Markov  table  computer  simulation  program  was  used  as  a  test 
case. 


III.  RESULTS; 

The  literature  review  was  unsuccessful  in  locating  any 
additional  Markov  tables  for  use  in  the  EIUs.  Although  the 
search  continues,  it  appears  unlikely  that  any  new  models 
will  be  discovered. 

In  the  study  of  the  use  of  simulators  of  various  types 

t 

to  produce  Markov  tables,  two  papers  devoted  to  this  subject 
were  discovered.  One  of  these  involved  an  analog  simulation 
of  the  Rayleigh  fading  model  [11].  The  second  was  a  digital 
computer  simulation  of  the  same  model  with  similar 
parameters  [12].  Together,  these  two  papers  provided 
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several  additional  Markov  tables,  albeit  several  levels 
removed  from  real  world  channel  error  performance. 

In  spite  of  almost  one  year  of  ext2nsive  literature 
review,  including  computer  data  base  searches,  to  date  very 
little  data  on  communication  channel  errors  has  been 
uncovered.  The  data  that  has  been  found  is  relatively  old, 
and  represents  equipment  that  is  no  longer  applicable  the 
data  communications  needs  of  the  U.S.  Air  Force. 

One  reason  for  the  lack  of  success  in  this  search  deals 
with  the  format  of  error  data  required  by  the  EIU  project. 
The  EIU  is  a  far  more  sophisticated  device  than  any  that 
have  been  used  to  emulate  data  errors  in  the  past.  The  EIU 
emulates  not  only  the  error  rate,  but  the  distribution  of 
errors  within  bursts.  This  produces  models  with  much  higher 
fidelity  than  other  types  that  tend  to  use  an  independent 
error  assumption. 

This  higher  fidelity  is  not  without  cost,  however.  In 
order  to  construct  a  Markov  table  that  models  the 
communication  errors  requires  complete  information  about  the 
error  distribution.  This  means  that  the  error  data  must  be 
collected  in  detail.  Just  counting  the  number  of  bits 
received  and  the  number  of  errors  is  not  adequate.  Data 
with  this  level  of  detail  is  more  difficult  to  collect,  and 
requires  more  storage  capability  than  just  collecting  error 
rates.  Hence,  the  amount  of  information  suitable  for 
conversion  to  Markov  tables  is  in  rather  short  supply. 
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As  a  result  of  the  lack  of  success  in  finding  either 
Markov  tables  or  appropriate  data  for  use  in  producing  such 
tables,  significant  effort  was  invested  in  the  design  of  an 
experiment  to  collect  the  needed  data  at  RL.  The  first 
draft  of  a  proposed  design  was  presented  in  a  "white  paper" 
during  a  visit  in  March  of  1991.  As  this  paper  met  with 
some  acceptance,  additional  effort  was  devoted  to  the 
refining  of  the  design  of  an  experiment  to  collect  the 
needed  data  on-site  at  Rome  Laboratory. 

To  perform  the  required  data  collection,  it  is 
necessary  to  set  up  a  transmitter  at  one  location  and  a 
receiver  at  another.  The  transmitter  would  transmit  a 
series  of  random  data  bits  over  the  communications  channel. 
At  the  other  end,  the  receiver  would  compare  the  data  bits 
received  with  the  bits  actually  transmitted,  and  record  the 
occurrence  of  errors.  For  the  purposes  of  the  initial 
experiment,  no  error  detection  or  correction  techniques 
would  be  used  with  the  data. 

At  the  recei\ _ng  end,  the  receiver  would  have  to 
generate  a  se^^aence  of  random  bits  identical  to  that  used  by 
the  transmitter.  The  receiver  must  compare  these  bits  to 
the  incoming  bits  and  detect  errors  within  the  data  stream. 
The  receiver  must  also  record  the  errors  that  are  detected. 
This  task  will  require  a  mass  storage  device  for  holding  the 
collected  data. 
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Figure  2  shows  a  block  diagram  of  the  equipment  needed 
to  set  up  the  data  collection  experiment.  The  blocks  on  the 
left  represent  the  transmitter  and  those  on  the  right  are 
the  receiver. 


Figure  2;  Data  Collection  Experiment  Hardware 


An  essential  piece  of  equipment  for  use  in  this 
experiment  is  the  Fireberd  6000  (or  equivalent)  Bit  Error 
Rate  Tester  (BERT) .  This  unit  can  be  programmed  generate 
several  different  types  of  random  bit  sequences  for  use  in 
the  data  collection  experiment.  By  using  two  such  units, 
one  at  the  transmitter,  and  one  at  the  receiver,  the 
generation  and  comparison  of  random  data  bits  becomes 
relatively  simple.  When  first  initiated,  the  receiving  BERT 
will  synchronize  itself  and  lock  on  to  the  bit  sequence  from 
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the  transmitter.  It  will  remain  in  synchronization  until  a 
loss  of  signal  or  extremely  long  error  burst  results  on  a 
loss  of  synchronization. 

By  connecting  the  BERT  units  to  a  transmitter  and 
receiver,  some  error  statistics  pertaining  to  the  data 
channel  can  be  collected  directly  by  the  BERT.  These 
statistics  include  error  counts,  bit  error  rates,  block 
error  rates,  etc.  Unfortunately,  the  statistics  collected 
by  the  BERT  are  not  sufficiently  detailed  for  the 
construction  of  Markov  tables. 

In  order  to  collect  the  error  gap  data  needed  for  the 
construction  of  the  Markov  tables,  a  computer  must  be 
included  in  the  system.  Ihe  BERT  unit  provides  a  number 
output  signals  that  will  permit  the  interfacing  of  the  BERT 
to  a  microcomputer  dedicated  to  collecting  the  error  gap 
data.  These  signals  include  a  loss  of  synchronization  (LOS) 
signal,  a  recovered  clock  (CK)  signal  and  an  error  (ERR)  bit 
signal. 

In  essence,  the  error  collection  system  would  work  by 
reading  these  status  bits  from  the  BERT.  Each  time  that  the 
CK  signal  is  received  (without  ERR) ,  the  gap  size  count  is 
increased  by  one.  When  an  ERR  signal  is  received  along  with 
the  CK  signal,  the  gap  size  count  is  stopped,  and  a  value 
representing  the  number  of  times  a  gap  of  length  equal  to 
the  current  gap  size  count  is  incremented  by  one.  The  gap 
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size  counter  is  then  reset  to  zero,  and  the  process 
continues. 

Gap  count  data  is  accumulated  in  one  hour  blocks  and 
then  transferred  to  disk  for  permanent  storage.  By  using 
this  approach,  in  the  event  of  a  test  failure,  all  but  the 
last  hour  of  data  would  be  valid  and  available  for  use.  In 
testing  the  status  bits,  the  loss  of  synchronization  signal 
is  always  tested  first.  LOS  represents  such  a  catastrophic 
error  that  the  experiment  is  terminated  when  that  signal  is 
asserted. 

The  major  limitation  to  this  approach  is  the  relatively 
slow  processing  speed  of  the  microcomputer.  When  two  or 
more  errors  occur  in  succession,  the  microcomputer  may  not 
be  able  to  process  the  gap  count  data  as  fast  as  it  is 
generated.  For  example,  with  a  data  rate  of  1.544  Mbps,  the 
time  between  the  arrival  of  two  successive  error  bits  is 
about  650  nanoseconds.  In  that  time  period,  the  pc  can 
perform  only  about  four  instructions.  Even  with  interrupt 
driven  software,  this  speed  is  inadequate  to  process  the 
data. 

For  this  reason,  an  interface  between  the  BERT  and  the 
microcomputer  is  necessary.  A  block  diagram  of  this 
Interface  is  shown  in  Figure  3.  The  interface  consists 
primarily  of  a  32  bit  counter  and  a  512  x  36  bit  (only  32 
bits  are  used)  FIFO  storage  buffer.  The  counter  is  driven 
directly  from  the  recovered  clock  from  the  BERT.  Each  time 
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a  bit  is  received  by  the  BERT,  the  counter  is  incremented  by 
one.  This  counter  represents  the  size  of  the  current  gap. 


Figure  3:  Computer  Interface  Block  Diagram 

When  an  ERR  signal  is  received  by  the  interface,  this 
triggers  two  operations.  First,  the  current  value  of  the 
counter  is  written  into  the  FIFO  store  for  later  retrieval 
by  the  microcomputer.  Immediately  thereafter,  the  value  in 
the  counter  is  cleared  in  preparation  for  beginning  the  next 
gap  count  sequence.  All  these  actions  are  effected  with  a 
single  invertor  connected  to  the  ERR  signal,  a  silicon  delay 
line,  and  a  monostable  multivibrator  that  generates  the 
counter  clear  strobe. 

The  FIFO  queue  is  a  dual-ported  RAM  memory  with 
integrated  queue  pointers.  It  contains  internal  circuitry 
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to  provide  a  full  (FF) ,  empty  (EF)  and  half-full  (HF)  flags. 
The  CPU  reads  these  flags  as  part  of  the  data  collection 
software.  If  EF  is  asserted,  the  CPU  need  do  nothing  and 
simply  remains  in  the  status  read  loop.  If  EF  is  negated, 
then  the  CPU  must  read  data  from  the  FIFO  and  move  it  into 
memory.  Because  the  interface  circuitry  counts  before 
writing,  a  count  value  of  n  in  the  register  represents  a  gap 
of  length  n-1.  The  software  makes  an  appropriate  adjustment 
to  the  input  data,  either  as  it  is  read,  or  before  updating 
the  gap  statistics. 

The  FIFO  queue  will  permit  collection  of  data  as  long 
as  an  error  burst  of  length  greater  than  512  bits  does  not 
occur  before  the  computer  can  read  the  data.  Should  this 
happen,  the  queue  may  become  full,  and  data  could  be  lost. 
This  is  extremely  unlikely  with  any  reasonable  data  error 
rates.  Should  the  FIFO  become  full,  it  would  be  treated  as 
a  catastrophic  failure  of  the  channel,  and  the  test 
discontinued. 

The  major  difficulty  in  designing  this  interface 
circuit  deals  with  the  timing  of  the  various  operations 
within  the  unit.  For  example,  the  write  to  the  FIFO  must 
not  take  place  while  the  counter  is  being  incremented.  Nor 
should  the  counter  be  cleared  before  the  write  operation 
(from  the  counter)  to  the  FIFO  queue  has  been  completed. 
These  criterion  are  shown  in  Figure  3.  This  figure  includes 
the  minimum  timing  requirements  imposed  by  the  various 
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components  in  the  system,  and  also  shows  the  timing  values 
achieved  by  the  interface  circuit. 


The  interface  circuit  is  reasonably  simple,  and  is 
constructed  using  wire  wrapped  circuitry  on  a  JDR 
Microdevices  brand  prototyping  card  that  includes  address 
decoding.  The  invertor  (74LS04)  inverts  the  positive  ERR 
signal  to  provide  the  active  low  W/  signal  required  to  write 
to  the  FIFO.  The  silicon  delay  circuit  (DS  1000-175) 
provides  a  105  nanosecond  delay  to  insure  that  the  counter 
has  stabilized  before  the  write  takes  place  (data  set  up 
time) .  The  rising  edge  of  the  W/  signal  triggers  the 
monostable  multiviabrator  (74LS122)  to  produce  an 
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approximately  60  nanosecond  low  pulse  to  clear  the  32  bit 
counter  after  the  write  has  been  completed. 

At  1.544  Mbps  data  rates,  the  minimum  arrival  time 
between  error  pulses  could  be  as  short  as  650  nanoseconds. 
All  the  operations  noted  above  are  completed  within  this 
time  frame  and  before  the  next  CK  and/or  ERR  signals  arrive. 
The  average  time  between  arrivals  of  error  bits  (i.e.  the 
average  gap  length)  is  the  reciprocal  of  the  bit  error  rate 
(BER)  of  the  channel  being  tested,  and  sequences  of  errors 
longer  than  10  bits  would  be  extremely  rare.  The  FIFO  store 
is  capable  of  handling  at  least  512  errors  in  sequence 
without  losing  data.  Should  that  prove  to  be  inadequate, 
pin  compatible  FIFO  stores  with  up  to  8K  storage  locations 
are  commercially  available  at  a  slightly  increased  cost. 

The  software  for  this  experiment  is  also  relatively 
straight  forward.  The  microcomputer  reads  the  status  port 
to  determine  if  the  FIFO  contains  data.  If  so,  the  FIFO  is 
read  and  the  gap  statistics  are  updated  based  on  the  value 
read.  The  computer  then  returns  to  the  loop  of  reading  the 
status  port.  If  the  status  port  indicates  that  the  FIFO  is 
empty,  the  computer  continues  ro  read  the  port  until  data  is 
available. 

Approximately  once  per  hour,  the  CPU  will  write  the 
accumulated  error  data  to  disk.  To  insure  a  complete 
record,  this  write  operation  is  perforr-'.d  in  conjunction 
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with  a  error  occurrence  in  the  data  stream.  This  way,  a 
single  gap  will  not  be  recorded  as  two  gaps  of  smaller  size. 

The  technique  of  collecting  and  storing  gap  data  rather 
than  raw  error  data  represents  a  compromise.  The  most 
comprehensive  data  would  be  obtained  by  storing  the  string 
of  error  bits,  exactly  as  they  occur  over  time.  The  problem 
with  attempting  to  store  the  error  information  in  this 
format  is  that  the  storage  requirements  would  be  enormous. 
The  most  practical  method  for  storing  the  error  data  is  to 
collect  and  store  only  the  error  gap  data.  This  approach 
permits  a  relative  simple  program  for  keeping  track  of  the 
gap  counts,  and  also  requires  minimal  storage  for  the  data. 
Since  not  all  gap  sizes  will  occur  in  any  particular 
experiment,  a  linked  list  is  the  most  appropriate  data 
structure  for  storing  the  gap  data.  The  software  for  this 
system  is  currently  under  development.  A  block  diagram  for 
the  full  interface  circuit  is  given  in  Figure  4. 

IV.  CONCLUSIONS: 

The  only  real  long  term  solution  to  the  problem  of 
providing  Markov  models  for  the  EIUs  is  to  collect  the  data 
and  then  convert  the  data  to  Markov  models.  The  experiment 
designed  for  this  purpose  during  this  project  represents  an 
excellent  beginning  on  this  task.  The  hardware  is  currently 
complete,  and  the  software  under  development  should  be 
capable  of  providing  enough  data  to  meet  the  needs  of  the 
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Figure  4:  Interface  Circuit 

robust  protocol  project.  This  project  should  be  continued, 
and  data  should  be  collected  on  a  variety  of  data 
communications  links  under  many  different  circumstances. 

The  current  interface  should  probably  be  duplicated 
and/or  improved  as  needed  to  permit  the  simultaneous 
collection  of  data  on  full  duplex  communications  links. 
Additional  units  may  be  needed  to  permit  simultaneous  data 
collection  on  various  media  such  as  HF,  VHF,  UHF,  etc.  One 
desirable  future  goal  would  be  to  establish  a  HF  data  link 
between  Mississippi  State  University  and  Rome  Laboratory  for 
the  continued  collection  of  data  and  for  the  extension  of 
interaction  between  these  two  organizations. 
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Research  into  the  use  of  heuristic  models  should 
continue.  Currently,  these  models  require  more  states  than 
are  available  in  the  EIU.  One  possible  future  experiment 
would  be  to  build  computer  simulation  models  of  the 
heuristic  models  and  use  these  programs  to  generate  gap 
data.  This  gap  data  could  then  be  converted  to  Markov 
models  using  the  currently  available  models.  The 
practicality  and  value  of  such  models  is  worthy  of  further 
study.  Such  models  may  provide  some  insight  into  the  nature 
of  the  error  processes. 

Some  additional  research  should  be  devoted  to  a  study 
of  the  potential  for  developing  certain  analytical  models  of 
data  communications  channels  to  determine  if  such  models 
could  be  adapted  for  use  in  the  EIU.  Examples  might 
included  meteor  burst  and  troposcatter  channels. 
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APPROXIMATING  NEURAL  NETS  WITH  NEURAL  NETS 
Michael  D.  Taylor,  Associate  Professor  of  Mathematics 

Abstract 

Given  a  nenral  net  whose  architecture  is  defined  ny  continuous  functions,  a  method  is 
exhibited  for  constructing  a  second  neural  net  whose  behavior  approximates  that  of  the  first  one 
arbitrarily  closely  and  whose  architecture  is  defined  by  continuously  differentiable  functions. 
This  provides  a  means  of  “training”  the  first  network  by  error  back-propagation  even  in  instances 
where  back-propagation  is  not  directly  applicable.  This  in  turn  gives  a  tool  for  studying  neural 
nets  with  “nonstandard”  architectures. 

1.  latroductioa 

Neural  nets  have  emerged  as  a  possible  alternative  or  complement  to  standard  computer 
technology.  They  are  believed  to  have  a  potential  for  performing  such  tasks  as  image  processing 
and  pattern  recognition  in  a  swifter  and  more  satisfactory  fashion  than  standard  computers.  One 
very  important  characteristic  of  neural  nets  and  one  reason  for  hoping  they  can  perform  some 
otherwise  difficult  tasks  is  that  neural  nets  have  the  potential  to  be  trained  rather  than  having  to 
be  programmed. 

The  most  widely  used  method  of  training  neural  nets  is  the  error  back-propagation 
algorithm.  There  is  a  certain  technical  restriction  on  this  method;  The  algorithm  requires  that 
the  ways  in  which  units  of  a  neural  net  depend  on  one  another  all  be  describable  in  terms  of  what 
are  called  continuously  differentiable  functions  (also  known  as  functions).  Not  all  nets  n:<‘''t 
this  requirement,  so  back-propagation  cannot  be  applied  to  them. 

A  particular  motivation  for  this  research  was  interest  in  neural  nets  having  an 
architecture  suggested  by  concepts  occuring  in  fuzzy  analysis.  Fuzzy  analysis  is  a  method  of 
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dealing  with  uncertainty,  particularly  uncertainty  arising  from  vagueness  rather  than  randomness. 
(Example:  When  is  a  pear  ripe?  There  is  more  than  one  possible  answer.)  There  has  recently 
been  a  great  deal  of  interest  in  using  fuzzy  logic  m  control  problems;  for  example,  as  described  iu 
[3],  an  automatic  tram  operating  system  has  been  successfully  tested  in  Japan.  Fuzzy  analysis 
has  also  been  turned  to  the  problem  of  pattern  recognition  (see  [2])  with  possible  applications  to 
military  intelligence  and  medical  data.  It  should  also  be  noted  that  there  is  recognition  m  the 
neural  network  community  of  the  desirability  of  studying  neural  networks  with  nonstandard 
architectures.  This  point  is  specifically  brought  out  in  the  DARPA  study  on  neural  networks,  [1], 
Another  example  occurs  in  [5]  where  nonstandard  architectures  of  interest  in  image  processing  are 
mentioned  as  worthy  of  further  investigation;  this  last  example  makes  use  of  operations  typical  of 
fuzzy  analysis. 

Very  little  seems  known  about  neural  net  architectures  which  derive  from  fuzzy  analysis 
concepts.  (But  an  example  of  such  an  architecture  can  be  seen  in  [4].)  One  difficulty  in 
investigating  such  aichitectures  is  that  the  functions  one  typically  uses  in  fuzzy  analysis 
(maximum  and  minimum)  are  nondifferentiable.  This  means  the  back-propagation  algorithm,  our 
most  widely  used  tool  in  training  a  neural  net,  would  not  be  applicable. 

The  objective  of  this  work  has  therefore  been  to  find  a  method  -  comparable  to  or 
analogous  to  the  back-propagation  algorithm -- of  training  “fuzzy”  neural  nets.  The  outcome  has 
been  better  than  that.  A  way  has  been  found  to  “extend”  back-propagation  to  any  neural 
network  whose  workings  can  be  completely  described  in  terms  of  continuous  functions.  This 
includes  not  only  the  “fuzzy"  nets  but  presumably  a  large  number  of  other  types  as  well  whose 
architectures  have  not  even  as  yet  been  described. 

2.  The  Problem 

The  back-propagation  algorithm  has  been  described  many  times  in  the  literature  on  neural 
nets.  (See,  for  example,  [6]  or  [?}.)  This  description  has  usually  been  given  with  certain 
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restrictions  in  mind,  namely  that  compntations  of  cell  states  will  in^^olve  mnltiplying  connection 
weights  by  other  cell  states,  adding  a  number  of  results  of  this  kind  together,  then  adding  (or 
subtracting)  a  “threshold”  value,  and  finally  applying  ^  so-called  “squashing  function”  to  the 
result.  These  particular  details  do  not  matter.  What  does  matter  is  that  a  very  particular 
architecture  is  being  assumed.  In  this  work  we  still  restrict  ourselves  to  layered,  feedforward 
networks,  but  for  our  purposes  we  need  -  and  give  -  a  description  of  the  back-propagation 
algorithm  which  applies  to  more  general  architectures  than  those  usually  encountered  in  the 
literature. 

We  then  come  to  the  basic  idea  of  this  work.  Suppose  we  are  given  a  neural  net  whose 
workings  are  described  by  continuous  functions.  We  construct  a  second  neural  net  whose 
behavior  approximates  that  of  the  first  net  very  closely  -  as  closely  as  we  may  wish  -  but  whose 
workings  are  described  by  (that  is,  continuously  differentiable)  functions.  Let  us  call  this 
secord  network  a  network.  The  fact  that  the  approximating  network  can  be  constructed  is 
a  straightforward  application  of  some  well-known  facts  from  mathematical  analysis.  A  proof  of 
the  construction  and  a  particular  way  to  carry  it  out  are  given.  The  back-propagation  algorithm 
can  now  be  applied  to  tram  the  network.  The  results  of  this  training  -  connection  weights  and 
threshold  values  of  cells  -  can  then  be  transferred  to  the  original  network.  Since  the  two 
networks  behave  in  very  similar  ways,  the  original  network  then  behaves  as  though  trained. 

We  then  exhibit  certain  technical  details  which  are  useful  in  actually  carrying  out  this 
program  and  conclude  with  some  analysis  of  certain  architectures  to  which  back-propagation 
would  net  normally  be  applicable. 

3.  Error  Back-propagatioB 

It  is  convenient  for  our  purposes  to  first  describe  a  generalized  form  of  error  back- 
propagation  for  a  synchronous,  feed-forward,  layered  ;.eural  net.  The  discussion  given  below 
actually  applies  to  slightly  more  general  networks.  We  assume  we  have  units  (or  neurons  or  cells) 
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Let 

X  it  a 

$  =  the  set  of  i  snch  that  is  an  inpnt  unit, 

Ct  =  the  set  of  i  such  that  is  an  output  unit, 

<$.  =  the  set  of  j  such  that  there  is  a  connection  running  from  u.  to  u-, 

1  )  1 

-  the  state  of  the  ith  unit, 

=  the  weight  of  the  connection  running  from  the  ;th  to  the  ith  unit, 

=  the  “bias”  or  “threshold”  value  of  the  ith  unit. 

We  think  of  ?.  as  the  set  of  "predecessors”  or  “parents”  of  the  ith  unit,  and  we  put  liias  and 
threshold  in  quotes  in  the  definition  of  because  we  may  use  it  in  ways  which  make  it 
inappropriate  to  picture  it  as  playing  that  role. 


It  is  usual  to  assume  the  states  of  the  units  are  related  by  an  equation  of  the  form 


where  the  s-’s  are  understood  the  be  the  states  of  the  u^’s  at  some  time  t,  where  is  understood 
to  be  the  state  of  at  time  t  +  1,  and  where  is  a  nonlinear  function  often  referred  to  as  a 
“squashing”  function.  However  we  shall  assume  the  more  general  relation 

'i  =  ^i  (  e  '  %'i  €9.’ 

where  as  before  the  s^’s  are  understood  to  be  states  at  time  t  and  is  a  state  at  time  <  + 1.  In 

this  last  equation  <S;};  c  »  aod  to  be  thought  of  as  vectors  whose  components 

;  ;  e  i; ;  e 

are  indexed  by  .  The  s^’s  and  and  play  the  role  of  independent  variables  in  the 
function  Sj,  and  it  is  very  important  to  note  that  is  assumed  to  be  a  continuously  differentiable 
function  of  these  variables. 


Let  E  be  an  index  of  performance  used  in  training  the  neural  net.  We  assume  that 

(1)  E  is  to  be  minimized, 

(2)  £  is  a  continuously  differentiable  function  of  the  output  states,  £  =  £  ^  {s^}.  g  ^  ]  • 
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E  might  be  the  sum  of  squares  of  training  errors  or  some  sort  of  entropy  or  some  other 
sort  of  index.  Error  back-propagation  gives  us  a  method  of  incrementally  changing  the  li/^^’s  and 
$j’s  so  as  to  produce  corresponding,  incremental  decreases  in  E  which  will  eventually  (hopefully) 
force  £  to  a  minimum  value  or  at  least  close  to  such  a  minimum. 

We  first  take  a  fixed  input  vector  {s.}^  ^  ^  .  Since  the  output  states  ^  ^  are  now 
completely  determined  by  the  weights  and  biases  of  the  neural  net,  we  may  view  £  as  a  function 
of  the  weights  and  biases, 


for  this  fixe.’  input.  Then  the  following  equation  essentially  describes  the  first  Taylor  polynomial 


for  E; 


+  Ef  <V 


i  1 

We  wish  to  change  the  w^^’s  and  ^^’s  in  such  a  way  as  to  carry  £  in  the  direction  of  steepest 
decrease.  Accordingly  we  set 


%  =  -'ll 

where  n  is  a  small,  positive  number.  It  follows  that  we  must  be  able  to  find  some  way  to  calculate 
these  partials  of  £  in  terms  of  the  fixed  input  vector. 


Now  choose  i  and  j.  It  follows  from  (2)  that  we  must  have 


JE_ 


_  Y'  i£ 


and 


—  -  a£ 


38 


keo 


Note  that  the  partials  of  £  with  respect  s^,  (where  is  an  output  unit)  are  all  computable  since  £ 


as  a  function  of  the  Sj^'s  is  given.  Next  it  follows  trum 


is„> 


rq  € 
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that  for  every  which  is  not  an  input  unit,  we  have 


ds 

Sw. 


P  _ 


9s, 


dm 


if  p  =  i. 


Pi 


and 


3s _  3s. 


=  3s.  dWii 

?  €  ?  }J 


if  ?  i, 


3s  ^^P  . 

=  m; 

r-'  ^^P  ^ 


Of  course  if  s^  is  an  input  state,  these  partials  are  zero. 


It  follows  from  the  last  two  equations  that  beginning  with  the  input  states  oie  may 
recursively  calculate  every 


3s 


% 


^  and  ^ 


33^ 


and  hence 


3£  3£ 

3;;;;  ■"-  p; 


4.  Some  Approxifflation  Results 

We  consider  layered,  feed-forward  neural  nets.  We  wish  to  approximate  the  functions 
which  transform  the  contents  of  the  nodes  on  one  layer  of  the  network  to  a  new  set  of  contents  on 
the  next  layer  by  transformations.  This  defines  a  new  net,  one  to  which  it  is  possible  to  apply 
the  error  back-propagation  algorithm.  The  important  thing  is  to  show  that  it  is  possible  to 
construct  the  new  net  in  such  a  way  as  to  be  sure  its  behavior  approximates  that  of  the  old  net  in 
some  satisfactory  fashion.  The  epsilon-delta  conditions  given  below  are  typically  associated  with 
uniformly  continuous  functions  or  continuous  functions  defined  over  compact  sets. 
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We  state  the  propositions  below  without  proofs. 


Our  first  result  shows  we  can  approximate  the  transformation  from  one  layer  of  a 
network  to  a  single  unit  in  the  next  layer.  Note  that  if  u  =  a  point  in  R^,  then 


the  standard  Euclidean  norm.  We  will  also  find  it  convenient  to  employ  the  norm  given  by 

||«ll  =  max|U;|. 

1  ‘ 

PROPOSITION  1.  Let  F:  R^^R  be  a  locally  integrable  function  such  that  1  F(x)  |  ^  M 
for  all  X  in  the  domain  of  F.  Suppose  (  and  S  are  positive  numbers  such  that 
I  P(Xj^)  -  ftXj)  I  ^  f  whenever  ||Xj^  “  11  ^  ^  be  a  real-valued  function  such  that 

I  is  C^, 

4  IQ, 

and 

II  u  -  V II  ^  Hor  all  «,  v  €  supp  ^ 

where  supp  #  stands  for  the  support  of  Let  G  =  F*^.  Then  G  satisfies  the  following: 

(1)  G  is  oV 

(2)  I  G(x)  I  i  M  for  all  x. 

(3)  !  C?(x)  -  P(x)  1  ^  €  for  all  x. 

(4)  I  (?(Xj)  -  CJ(X2)  I  ^  €  whenever  ||  x^  “  11  ^ 

NOTE  1:  The  boundedness  of  F  is  used  here  only  to  deduce  the  boundedness  of  G  by  the 
same  bound.  If  the  statements  about  boundedness  were  omitted,  the  proposition  would  still  be 
true.  There  are  two  justifications  for  including  such  a  condition.  On  the  one  hand  it  is 
physically  reasonable;  in  a  real  world  neural  net,  the  values  assigned  to  a  unit  cannot  become 
infinitely  positive  or  negative.  On  the  other  hand,  if  we  consider  neural  nets  with  an  architecture 
inspired  by  fuzzy  analysis,  we  may  want  our  units  to  take  on  values  between  0  and  1,  and  it  may 
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be  important  to  know  that  if  a  net  described  by  F  has  this  property,  then  the  “approximating” 
net  described  by  G  also  has  this  property. 

Our  next  proposition  shows  how  to  extend  the  approximation  process  to  the 
transformation  which  takes  one  layer  of  the  network  to  the  next  layer.  D  may  be  thought  of  as 
the  number  of  units  and  connections  and  biases  in  one  layer  and  R  as  the  number  of  units  in  the 
succeeding  layer. 

PROPOSITION  2.  Suppose  F:Vp->Xt^.  We  may  write  F  in  the  form  (F^tF^f-^F 
Assume  that  each  F-  is  locally  integrable  and  that  for  all  i  and  all  x  we  have  I  F-(x)  |  ^  M. 
Suppose  f  and  S  are  positive  numbers  such  that  |j  -  f (012)  j|  ^  f  whenever  ||  -  ®2 1|  ^ 

^1’  ^2’  ■  ■  ■’  ^R'  satisfy 

each  is  C^, 

each  2  0, 

_  ^ =  1  for  each  i,  and 

J  ’ 

for  all  1  we  have  ||  u  -  y  II  ^  J  for  all  u,  y  €  supp  #  • . 

For  each  i,  set  G^  =  F^*^-  and  let  G  =  We  then  have  the  following 

conclusions: 

(5)  G  is 

(6)  I  Gj(x)  I  ^  W  for  all  1  and  x. 

(7)  II  G(x)  -  F(x)  II  <;  €  and  |  ^(x)  -  F(x)  I  <[  f  for  all  x. 

(8)  ||ff(Xj)- (3(x2)||^  f  and  |  (?(Xj^)  -  G(x2)  I  ^ -JR  «  whenever  ]|  x^  -  X2 1|  ^  J. 

NOTE  2:  If  /f  is  a  function  of  F  variables,  we  say  the  ith  variable  is  active  provided  we 
can  find  u  and  y,  differing  only  in  the  ith  variable,  such  that  H(u)  H(v).  In  constructing  the 

convolutions  note  that  may  be  taken  to  be  a  function  of  only  the  active  variables  of 

and  that  the  evaluation  of  (F^*^-)(x)  may  possibly  be  obtained  by  integrating  over  a  lower 

dimensional  space  than  R^.  This  can  be  a  useful  fact  if  the  number  of  nodes  in  the  neural  net  is 
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large  because  D  is  an  upper  bound  for  the  number  of  nodes  in  a  certain  layer.  If  F.  has  112  active 
variables,  then  instead  of  manufacturing  ,  we  can  simply  set  =  F^  .  Then  (5)  follows 
because  F-  is  a  constant  function,  and  (6),  (7),  and  (8)  are  true  trivially. 

The  following  result  should  be  thought  of  as  applying  to  neural  nets  with  L  layers.  Each 
Fj  can  be  considered  to  be  the  function  which  takes  the  node  values,  the  connection  weights,  and 
the  biases  associated  with  one  layer  and  transforms  them  into  the  node  values  associated  with  the 
next  layer. 

PROPOSITION  3.  Let  F^,  F^  map  thus: 


Assume  each  component  of  each  F^  is  locally  integrable  and  that  there  is  a  given  sequence  of 
positive  numbers  satisfying 

||F/Xi)-F/x2)11^4',  whenever  pj  -  *211^  ^  ~  1  • 

For  each  F.  manufacture  a  G-  m  the  manner  described  in  Proposition  2  with  i  i ^  and  e  s  f  • 
1  Da 

.  Then  each  G^  is  and  for  all  x  €  R  ''we  have 

NOTE  3:  One  may  give  another  version  of  this  proposition  in  which  the  condition 

'^‘^enever  pi -*211^  ^-1 

is  replaced  by 

I  Fj(x^)  -  Fp2)  I  ^  -f=  whenever  1  -  Xj  I  ^  j 

N  t 

and  the  conclusion  is 

l(G^o(3^_^o...o(J^)(i)  -  (F^oF^_j0...oF^)(x)!  £  LS^. 

NOTE  4.  One  may,  if  one  wishes,  think  of  the  last  layer  of  the  network  as  corresponding 
to  a  performance  index  F.  These  propositions  then  indicate  how  to  construct  a  network  in 
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such  a  way  ^Lat  its  oer'^orman-e  index  will  approximate  that  of  the  original  network  as  closely  as 
one  desires. 

5.  Construction  and  Training  of  Networks 

Let  us  now  consider  some  of  the  details  involved  in  carrying  out  the  program  described  in 
the  last  two  sections. 

Suppose  that  we  have  found  a  sequence  ig,  of  the  sort  required  by  Proposition 

3.  Let  K-  be  the  higher  dimensional  “cube” 

where  A.  is  the  number  of  active  variables  of  the  state  s-  of  the  ith  unit  of  the  network.  We 
choose  ^.;R  ‘-fR  such  that 
is  CK 

and  supp  C  if . . 

Notice  that  this  last  condition  implies  llu- viK  for  all  €  supp 

We  now  introduce  functions  which  define  the  values  of  nodes  in  the  network.  Set 
Sj  =  if  Uj  IS  an  input  node 

5  ■♦f I  if  Uj  is  flfli  an  input  node. 

(Note;  If  we  take  E  to  be  our  output,  then  we  may  want  to  compute  a  new  performance  index  of 
the  form  E  -  Eti.  This  E  may  be  taken  as  the  performance  index  of  the  network.) 

Let  u  and  y  stand  for  vectors  at  which  can  be  evaluated,  i.  e.,  they  are  shorthand 
symbols  for  vectors  of  the  form 

We  now  begin  the  training  of  the  network. 
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step  L.  Starting  with  the  inputs  is^}^  ^  ^  ^  ^  and  given  connection  weights  <w.^} 

and  thresholds  iS^},  compute  the  outputs  of  the  neural  net,  {?^.}^  ^  ^  and  the  vectors  u  at 

which  each  ?.  is  evaluated. 

Step  2^  Check  the  performance  index  to  see  if  the  network  is  trained.  Typically  this  may 
amount  to  seeing  if  £  ^  fQ  (or  E  ^  fg)  for  some  given  fg.  (It  may  be  necessary  to  check  the 
performance  index  for  a  number  of  input  vectors,  not  just  for  one.)  If  the  network  is  trained, 
then  stop  and  transfer  the  connection  weights  and  threshold  values  from  the  network  to 
the  original  network.  By  Proposition  3  the  outputs  of  the  original  network  will  now  differ  from 
those  of  the  network  by  at  most  f  =  iS^.  If  the  network  is  not  trained,  then  go  to  the 
next  step. 


Step  3^  Use  the  computed  outputs  to  evaluate  g  ^  (or  g  ^). 

Step  If  u.  belongs  to  layer  0  (the  inputs),  take  all  partials  of  s-  to  be  0  for  all  i. 

Step  L  Consider  the  highest  layer  k  of  the  network  with  the  property  that  for  all 


units  in  a  lower  layer  all  partials  of  the  form 

9?;  95, 


5^' 


have  been  computed  (where  lies  in  a  lower  layer  than  that  containing  and  lies  in  a 
layer  no  lower  than  that  containing  «^).  Then  for  all  u-  in  layer  k,  we  compute 


^95A  r  /9^-\ 

9rr=  J 

\ \ }/ 


0  if  belongs  to  layer  fc  - 1  and  ;  g  9^ 


E 

m£9 


•5=4-  if  u.  belongs  to  a  layer  lower  than  k-l. 
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J!i 

rw 


r  {^^i\ 

•l(ti)=  hi — ^ku  — V)i»  if  171  = 

'/  K.  ^ 


1  and  ;  G  '? . 


0  if  m  =  i  and  i  2 


rJ 

if  belongs  to  a  lower  level  than  k. 


(9s  A  r  (di\ 

brr=  J  ‘f'=’ 

\  }/  V  \  }j 


0  if  belongs  to  layer  k  and  ;  ^  i 

y'  I 1  f-^l  if  u.  belongs  to  a  layer  below  k. 

After  having  carried  out  these  computations,  if  k  <  L,  then  repeat  this  step.  If  1;  =  L,  then  go  to 
the  next  step. 


Step  £.  For  all  i  and  j  now  compute 


Step  L  For  all  i  and  ;  replace 


u/..  by  — 

*1;  "i;  dw-. 


and 


h  by  $-1) 


dE 


1  '  1  '  d$. 
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where  n  is  some  appropriately  chosen  small,  positive  number. 


Step  L  Return  to  step  1. 

6.  ConstnictiB9  Deltas  from  Epsilons 

Propositions  1  and  2  refer  to  (  and  S  which  are  related  to  one  another  in  a  certain  way, 
and  Proposition  3  refers  to  a  sequence  fg,  ij,---,  with  the  property  that  each  member  of  the 
sequence  is  related  to  the  one  before  and  after  it  in  a  certain  way.  We  give  here,  without  proof,  a 
technical  result  which  is  sometimes  helpful  in  constructing  such  pairs  (  and  $  and  such  sequences 

Jg. 

First  if  F:  ff-iR”,  where  K  C  R^,  recall  that  the  ith  variable  of  F  is  active  if  and  only  if 
there  exist  u  and  v  in  K  which  differ  only  in  their  ith  components  and  have  the  property  that 
P(u)  ^  F(v). 

PROPOSITION  4.  Let  Jf  be  a  compact,  convex  subset  of  R”*.  Suppose  . 

are  maps  and  G:  ff-iR”  is  a  continuous  map  with  the  property  that  for  all  i  €  there  is  an  i 
such  that  cJ(x)  =  G^(x).  Suppose  further  that  each  G^  is  a  function  of  at  most  A  active  variables. 
Then  for  all  u,  v  €  we  have 

||G(u)-<3(»)ll  ^  ^  W|iu-i/|l 

where  each 


M  =  max 


X  e  ff,  1  i  ^  p,  1  ^  ^  n,  and  1  ^  j  ^  m 


7.  Some  Case  Analysis;  Epsilons  and  Deltas 

The  principal  difficulty  in  implementing  the  approximation  technique  for  neural  nets 
lies  in  computing  the  epsilons  and  deltas,  in  particular  the  deltas  of  Proposition  3.  Here  we 
analyze  how  this  could  be  done  for  a  few  particular  cases. 
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Casp  We  consider  a  neural  net  with  an  architecture  given  by 

where  fj.  is  the  threshold  function, 

=  0  if  s  <  0 

X  if  0  ^  X  ^  1 
1  if  1  <  X. 

Notice  that  fj.  ,  though  continuous,  fails  to  be  differentiable  at  0  and  1  so  that  back-propagation 
IS  not  directly  applicable  to  this  neural  net. 

Given  e  >  0,  we  wish  to  find  J  >  0  such  that  ilu  -  v il  ^  f  implies  I  Sj(u)  -  s ■(«)  I  ^  e. 

We  see  that  is  a  continuous  function  which,  depending  on  the  values  of  its  variables,  is 
equal  to  one  of  the  functions 

Thus  Proposition  4  is  applicable. 

Let  B  be  a  positive  number  which  1  1 1  I  1 ,  and  I  |  are  “unlikely”  to  ever 

exceed  for  any  possible  m  or  j.  It  will  be  convenient  to  suppose  S  ^  1.  We  restrict  our  attention 
to  u  and  v  satisfying  Hull  and  II vll  ^  B.  We  compute  the  M  of  Proposition  4: 


max 

i, 


3.. 

9s. 

3., 

I 

9s. 

t 

) 

I 

max  4  0, 1  ,  I  5. 1  ,U  =  B. 

1, ;,  m  t  f  ) 


Let  A  be  the  maximum  number  of  active  variables  any  s-  can  have.  Then  by  Proposition  4  we 
have 


Thus 


I  Sj(u)  -  ^j(w)  I  i  ^F||u-v||. 


II  u  -  « II  ^  (lAB  implies  I  s^tu)  -  s-(v)  I  ^  f . 
Notice  that  this  last  inequality  is  independent  of  the  choice  of  i. 
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Choose  f  >  0  and  let  L  be  the  number  of  layers  of  the  network.  Referring  to  Propositions 


1  through  3,  we  see  that  we  can  set 
=  f/i, 

_  1  =  iJAB  =  dJ.AB, 

etc. 

In  general  we  have 


These  are  the  values  needed  to  ensure  that  the  outputs  of  the  network  will  differ  from  the 
corresponding  outputs  of  the  original  network  by  at  most  f. 

Case  2*  We  assume  we  have  a  network  with  architecture  given  by 


where  V  stands  for  “max”  and  A  stands  for  “min”  and  0  <,  1  for  all  m  and  n. 

This  IS  the  sort  of  architecture  investigated  in  (4). 

The  last  case  shows  us  the  pattern  to  be  followed. 

Note  that  for  any  particular  choice  of  variables  we  must  have  s-  =  or  or  .  Thus 
the  partials  of  5.  turn  out  to  be  0  or  1.  It  follows  that  the  W  of  Proposition  4  is  1.  As  usual,  let 
A  =  the  maximum  number  of  active  variables  for  any  unit.  Therefore 

II «  -  II  ^  €lA  implies  I  Sj(u)  -  s  j(«/)  I  ^  f  . 

Note  that  this  last  inequality  is  independent  of  i. 

Let  e  >  0  be  given  and  L  be  the  number  of  layers  of  the  network.  Consulting  Propositions 
1  through  3  again,  we  see  that  we  want 

h  = 

-  1  = 
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etc., 

or,  in  general, 


_ _ e 


Case  1.  Let  us  assume  an  architecture  of  the  form 


where 


/(s)  = 


l  +  e-«» 

for  some  of  >  0.  Architectures  of  this  sort  are  mentioned  in  [5]  as  being  of  interest  in  image 
processing. 

It  is  straightforward  to  show  -  and  turns  out  to  be  useful  to  know  -  that  the  maximum 
value  of  f'(x)  is  ali.  Notice  that  for  particular  cLuices  of  the  variables  we  must  have  = 
^ j)  for  some ;  €  . .  The  derivatives  of  these  functions  must  have  the  form 

f\^s^  +  S.)  s^, 

or 

0. 

Supposing  I  $ ^  I ,  I  w-  I ,  and  I  I  and  B  tU  then  the  magnitudes  of  these  derivatives  will 
all  be  bounded  by  aBli.  Thus  we  may  take  M  =  aF/4  when  applying  Proposition  4. 


Assuming  f  >  0  given,  we  then  see  we  should  take 
=  €/i, 

h-1- 

h-2^  (4^f)/(i(a^S)2). 

etc., 
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(where  A  is  of  coarse  an  upper  bound  on  the  number  of  active  variables  per  node)  or,  in  general, 

^  L(aAB)^  “  ^ 

Case  4*  Consider  the  architecture  defined  by 


where  /j,  is  the  thieshcld  function  from  Case  1.  This  is  again  an  architecture  of  the  sort 
described  in  [5]  as  being  of  interest  in  image  processing. 

For  particular  cnous.*,  of  the  variables  we  must  have 
Sj  =  0  or 

+  ^  ■  for  some  ;  or 

1. 

The  derivatives  of  these  functions  are  0  or  1.  The  analysis  for  this  case  reduces  to  that  for  Case 
2,  and  we  obtain 


8.  Conclusions 

We  have  shown  how,  m  theory,  one  can  approximate  a  neural  net  whose  architecture  is 
defined  by  continuous  functions  by  a  neural  net  whose  architecture  is  defined  by  functions. 
This  leads  to  a  technique  for  indirectly  applying  the  error  back-propagation  algorithm  to  the  first 
neural  net.  In  addition  a  slightly  generalized  version  of  the  back-propagation  algorithm  is 
exhibited.  Therefore  a  large  number  of  neural  net  architectures  are  opened  for  investigation  using 
the  back-propagation  algorithm  which  would  not  normally  be  accessible. 

Several  further  lines  of  investigation  are  suggested  by  these  results: 

(1)  Software  should  be  developed  to  test  the  applicability  and  efficacy  of  the  techniques 
put  forth  here.  Can  networks  be  trained  this  way  to  cope  with  such  standard  tasks  as  the  XOP. 


problem  or  character  recognition? 


(2)  Can  these  approximation  techniques  be  extended  to  networks  whose  architectures  are 
defined  by  discontinuous  functions?  For  example,  the  signum  function  is  used  in  a  number  of 
architectures.  The  norm  used  to  measure  approximations  in  this  present  work  is  essentially  the 

norm.  One  suspects  that  such  an  extension  would  require  a  different  sort  of  norm. 

(3)  These  techniques  could  be  used  to  carry  out  an  investigation  of  neural  nets  with 
“fuzzy”  architectures  (that  is,  architectures  suggested  by  the  concepts  of  fuzzy  analysis).  Again  it 
would  be  reasonable  to  ask  if  such  networks  can  be  trained  to  handle  such  standard  tasks  as  the 
XOR  problem  or  character  recognition. 

(4)  One  of  the  difficulties  in  applying  back-propagation  is  the  occurrence  of  local  minima 
which  produce  false  solutions  of  the  training  problem.  It  is  possible  that  the  construction  of 
neural  nets  to  approximate  given  neural  nets  might  be  used  as  a  technique  to  get  rid  of  such  local 
minima. 

(5)  The  version  of  the  back-propagation  algorithm  presented  here,  combined  with  the  idea 
of  approximations,  might  make  it  easier  to  investigate  performance  indices  S  other  than  the 
sum  of  the  squares  of  the  errors.  For  example,  fuzzy  analysis  suggests  the  possibility  of 
performance  indices  which  are  linked  to  the  notion  of  inference  or  implication  and  which  are 
constructed  using  nondifferentiable  functions. 
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This  document  provides  an  outline  for  implementing  a  successful  hierarchical,  integrated  sim¬ 
ulation  model  at  Rome  Laboratories  for  the  purpose  of  modeling  tactical  battle  engagement 
scenarios.  It  is  recommended  that  a  analytical  framework  be  used  to  guide  the  development 
of  the  integration  process.  This  analytical  framework  focuses  on:  (a)  the  transfer  of  data 
elements  between  model  modules,  (b)  the  estimation  of  response  population  characteristics 
of  interest,  (c)  the  design  of  simulation  experiments  in  order  to  estimate  a  regression  model  of 
interest,  and  (d)  the  application  of  variance  reduction  techniques  throughout  the  integrated 
model.  The  result  of  fully  integrating  the  existing  simulation  models  would  be  the  ability 
to  accurately  and  validly  model  tactical  battle  engagement  situations  with  unprecedented, 
varying  levels  of  detail  that  could  be  selected  according  to  the  modeler’s  agenda.  Thus,  the 
fully  integrated  model  would  serve  as  a  valuable  test  bed  for  evaluating:  (a)  existing  and 
proposed  hardware  systems,  (b)  existing  and  proposed  simulation  models,  and  (c)  battle 
engagement  strategies. 


Key  Words:  simulation,  hyerarchical  model  integration,  control  variates,  linear  models,  sim¬ 
ulation  experiments,  variance  reduction  techniques. 
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1.  introduction 


In  this  report  we  briefly  outline  a  methodology  for  fully  integrating,  in  a  hierarchical  fashion, 
existing  simulation  models  currently  used  at  Rome  Laboratories  for  modeling  aspects  of 
tactical  battle  engagement  scenarios  (although  the  basic  elements  of  this  methodolgy  could 
be  effectively  used  to  integrate  different  levels  of  simulation  models  used  in  other  modeling 
arenas  that  are  of  interest  to  the  Air  Force,  such  as  communications  network  models,  materiel 
and  personnel  traffic  flow  models,  etc.).  The  result  of  fully  integrating  the  existing  simulation 
models  would  be  the  ability  to  accurately  and  validly  model  tactical  battle  engagement 
situations  with  unprecedented,  varying  levels  of  detail  that  could  be  selected  according  to 
the  modeler’s  agenda.  Thus,  the  fully  integrated  model  would  serve  as  a  valuable  test 
bed  for  evaluating:  (a)  existing  and  proposed  hardware  systems,  (b)  existing  and  proposed 
simulation  models,  and  (c)  battle  engagement  strategies. 

The  tactical  battle  engagement  arena  is  recommended  as  a  test  bed  for  formulating  and 
implementing  the  simulation  model  integration  methodology  for  the  following  reasons:  (a) 
simulation  models  within  this  arena  (as  opposed  to  communications  network  models,  etc.) 
are  of  paramount  interest  to  the  Air  Force,  (b)  highly  sophisticated  simulation  models  of 
many  aspects  of  the  tactical  battle  engagement  environment  are  currently  being  used  at  Rome 
Laboratories  (many  of  these  simulation  models  were  developed  by  government  contractors 
with  the  direction  and  cooperation  of  Rome  Laboratories  personnel),  and  (c)  initial  ground 
work  for  implementing  the  integration  methodology  has  already  begun  through  the  efforts 
of  the  Joint  Simulation  Panel  (JPL)  which  was  established  in  1988  at  Rome  Laboratories 
primarily  through  the  efforts  of  Joe  Cruskie  (COAA),  Neal  Marples  (COAA),  Jim  Papagni 
(IRR.A.).  and  Alex  Sisti  Jr.  (IRAE)  and  the  recommendation  of  IR  and  CO.  Currently,  JPL 
consists  of  Jerry  Dussault  (COAA),  Jim  Papagni  (IRRA),  and  Alex  Sisti  Jr.  (IRAE). 

Together,  they  have  begun  to  formulate  a  means  of  directing  and  managing  a  truly  inte¬ 
grated  simulation  effort  at  Rome  Laboratories.  Up  until  this  time,  their  ideas  for  simulation 
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model  integration  have  focused  on:  (a)  personnel  staffing  and  involvement  issues  regarding 
the  simulation  integration  problem,  (b)  evaluation  of  the  candidate  simulation  code  so  as 
to  determine  those  simulation  programs  that  will  participate  (at  least  initially)  in  the  in¬ 
tegration  effort,  (c)  selection  of  a  spt-.ific  system  scenario  to  be  modeled  intially,  and  (d) 
evaluation  of  the  computer  hardware  that  may  serve  as  platforms  for  the  integrated  model. 

However,  through  their  efforts,  they  have  identified  one  key  component  necessary  to 
successfully  developing  an  integrated  simulation  effort  at  Rome  Laboratories  that  is  cur¬ 
rently  missing.  That  component  is  an  analytical  (statistical)  framework  for  directing  the 
transfer  of  data  (data  blocks,  statistical  measures,  etc.)  between  the  model  components  that 
will,  necessarily,  comprise  the  integrated  simulation  model  structure  as  well  as  coordinate 
the  implementation  of  variance  reduction  techniques  and  performance  measure  estimation 
strategies  that  are  critical  to  obtaining  optimal  information  outputs  (especially  for  large-scale 
simulation  models).  This  analytical  framework  is  the  key  component  to  the  entire  simulation 
integration  effort  because  it  will  guide  the  efforts  in  the  other  four  areas  mentioned  above  to 
a  common  goal.  This  concept  is  indicated  in  Figure  1  below. 

In  the  remainder  of  this  document,  we  develop  an  outline  for  structuring  the  analytical 
framework  for  the  simulation  integration  project.  In  the  next  section  we  outline  the  part 
of  the  analytical  framework  that  will  coordinate  the  transfer  of  data  sets  between  simula¬ 
tion  models  within  the  integrated  model.  In  Section  3  we  discuss  how  the  estimation  of 
performance  measure  characteristics  of  interest  should  be  structured  within  the  analytical 
framework,  and  in  Section  4  we  indicate  how  the  implementation  of  variance  reduction 
techniques  should  be  done  within  the  integrated  model.  Together,  these  three  sections  of¬ 
fer  a  comprehensive  structure  for  developing  a  successful!  integrated  simulation  model  with 
unparalled  performance  capabilities  for  analyzing  force  effectiveness. 

2.  Data  Transfer 

Consider  the  hierarchical  simulation  model  configuration  depicted  in  Figure  2.  We  anticipate 
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four  primary  levels  of  model  effort  th^t^  together,  will  comprise  the  fully  integrated  simu¬ 
lation  model.  These  four  levels  are,  from  narrowest  scope  to  broadest  scope:  (a)  Primary 
Component  Models  (Level  1),  (b)  Elementary  Systems  Models  (Level  2),  (c)  Small  Mission 
Models  (Level  3),  and  (d)  Campaign  Models  (Level  4). 

The  Level  1  models  primarily  deal  with  modeling  individual  systems  such  as  jammers, 
sensors,  transmitters,  etc..  They  receive  inputs  directly  from  the  model  analyst,  or  some 
database  indicated  by  the  model  analyst,  and  send  outputs  directly  to  the  Level  2  models. 
They  represent  the  highest  level  of  detail  in  the  integrated  model.  Currently,  a  number  of 
simulation  models  at  this  level  already  exist  at  Rome  Laboratories  and  are  under  frequent 
use. 

Level  2  models  focus  on  modeling  the  interaction  of  a  Level  1  model  with  a  specfic 
platform;  e.g.,  the  modeling  of  an  ESM  suite  installed  on  an  aircraft.  The  effectiveness 
of  the  installed  system  is  typically  evaluated  in  the  context  of  a  one-on-one  or  few-  »n-few 
analysis.  The  inputs  into  models  at  this  level  are  comprised  of  two  types:  (a)  those  received 
as  outputs  from  the  Level  1  models  and  (b)  direct  inputs  from  the  model  analyst.  Models 
at  this  level  retain  all  of  the  detail  of  the  models  at  Level  1  via  the  Level  1  outputs  that  are 
fed  directly  into  them.  However,  they  incorporate  this  detailed  information  into  a  broader 
scope  of  engagement  activities.  Outputs  from  the  Level  2  models  are  fed  directly  into  Level 
3  models.  Currently,  Rome  Laboratories  maintains  an  active  development  of  models  at  this 
level  (e.g..  The  Tac  Brawler  simulation  package  being  used  in  modeling  efforts  in  NCTI). 

Level  3  models  focus  on  modeling  whole  sub-theater  engagements  via  the  information 
fed  into  the  model  as  outputs  from  the  Level  2  models  and  the  model  analyst.  Models  at  this 
level  take  an  even  broader  view  of  an  engagement  scenario  than  Level  2  models,  although  still 
letaining,  in  theory,  all  of  the  detail  contained  at  the  lower  levels.  We  anticipate  that  Level 
3  models  will  be  used  to  ascertain  the  effectiveness  of  tactical  plans  and  decision  making 
schemes  in  a  combat  mission  evironment.  Currently,  Rome  Laboratories  is  beginning  to  use 
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Figure  2:  Hierarchical  Simulation  Model 
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a  Level  3  model  (Tac  Suppressor)  within  NCTI.  Outputs  from  the  Level  3  models  will  be 
fed  into  the  Level  4  model. 

Finally,  the  Level  4  model  represents  the  most  comprehensive  model  of  all.  That  is,  it 
has  the  broadest  scope  in  terms  of  its  model  domain.  This  model  will  encompass  all  rele¬ 
vant  activity  associated  with  operations  in  the  joint  (Air  Force,  Army,  and  Navy)  campaigns 
against  a  full  complement  of  combined  enemy  forces.  The  model  will  be  used  to  test  alter¬ 
native  strategic  plans  as  well  as  high  detail,  lower  level  components  within  a  full  campaign 
scenario.  Inputs  to  the  Level  4  model  will  be  obtained  from  the  Level  3  models  and  the 
model  analyst.  Ouputs  from  this  model  will  go  directly  to  the  model  analyst  and  C^I  for 
evaluation  and  decision  making.  Currently,  Rome  Laboratories  does  not  possess  this  soft¬ 
ware  capability.  However,  the  analytical  framework  that  we  are  proposing  in  this  document 
will  greatly  serve  in  'ts  development. 

Next,  we  discuss,  in  general,  how  data  is  to  be  transfered  between  the  models  copmrising 
the  four  levels.  In  order  to  do  that,  we  first  must  define  some  useful  notation.  Let 

•  x,jk  =  the  kth.  {k  =  1,2, ...,6,j)  analyst  supplied  input  to  the  jth  (;  =  1,2, .,.,ai) 
model  at  the  ith  (i  =  1,2, 3,4)  level. 

•  Wijk  =  the  kth  (A:  =  1,2, . . .  ,c,j)  lower-level  supplied  input  to  the  jth  (j  =  1,2, . . .  ,Ot) 
model  at  the  ith  (i  =  2,3,4)  level. 

•  y,jk  =  the  kth  (A:  =  1, 2, ... ,  d,_,)  model  generated  output  from  the  jth  (j  =  1,2,...,  a,) 
model  at  the  ith  (i  =  1,2, 3, 4)  level. 

•  /,j  =  the  conversion  function  that  operates  at  the  jth  (j  =  1,2, ...  ,e,)  model  of  level 
i  {i  =  1,2, 3, 4). 

The  data  generated  by  the  model  modules  comprising  the  integrated  simulation  model 
will  be  of  three  types:  (a)  time  dependent  information,  (b)  observational  information,  and 
i^c)  graphical  information.  Time  dependent  information  is  dependent  upon  the  length  of  the 
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simulation  run.  Observational  information  does  not  depend  on  the  length  of  the  simulation 
run.  Graphical  information  will  complement  the  first  two  forms  of  data,  but  will  not  add  to 
it.  That  is,  all  relevant  information  will  be  given  in  one  of  the  first  two  types  listed  above 
and  the  graphical  data  will  be  used  to  interface  more  effectively  with  the  model  analyst  and 
other  users. 

The  flow  of  information  (data)  utilizing  these  components  is  depicted  in  Figure  3.  Solid 
lines  with  arrows  indicate  the  direction  of  automatically  fed  data  from  one  model  level  to 
another.  Data  is  passed  along  these  lines  in  standardized  formats  with  prescribed  agendas 
determined  by  the  model  from  which  the  data  originates.  Dashed  lines  with  arrows  indicate 
how  feedback  information  is  routed  back  into  a  model  at  what  is  deemed,  by  the  model 
analyst,  an  appropriate  level.  This  feedback  data  process  is  monitored  and  administered 
by  the  model  analyst  by  means  of  a  decision  process  determined  by  the  model  analyst. 
Currently,  we  do  not  exactly  know  all  of  the  information  that  will  be  made  available  to  the 
model  analyst  during  simulation  runs  and  how  often  this  information  is  updated  (certainly, 
much  of  this  information  will  be  determined  by  the  context  in  which  the  integrated  model  is 
to  be  built).  We  also  do  not  know  at  this  time  all  of  the  elements  of  the  decision  process  used 
by  the  model  analyst  for  interpreting  this  information.  In  fact,  although  it  certainly  would 
be  expected  that  the  actual  analyst  involvement  would  be  accomplished  via  a  computer 
program  interface  with  the  integrated  model,  at  this  point  we  offer  no  exact  prescription 
for  carrying  out  such  a  decision  policy.  However,  we  recommend  that  the  minimum  set  of 
decisions  to  be  made  in  the  feedback  pheise  by  the  model  analyst  consist  of: 

•  Tactical  simulation  run  decisions.  These  include  determining  when  to  start  and  to 
stop  simulation  runs  for  individual  model  modules.  We  anticipate  that  in  a  fully 
integrated  simulation  model  that  many  of  the  individual  model  modules  will  start  and 
stop  at  different  times  during  the  integrated  model  run.  In  large  part,  the  starting  and 
stopping  of  the  individulual  model  modules  will  be  determined  by  conditions  obtained 
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elsewhere  in  the  integrated  model.  The  analyst  will  be  responsible  for  determining 
when  these  conditions  are  met  for  each  model  module  in  the  integrated  model  and 
thus,  implementing  the  implied  action  of  the  affected  model  module. 

•  Updating  of  individual  model  module  estimates.  This  includes  updating  model  es¬ 
timates  of  relevance  generated  elswhere  in  the  integrated  model  and  adjusting  their 
input  to  the  model  module  in  question  in  a  timely  fashion.  The  analyst  is  responsible 
for  coordinating  the  entire  set  of  integrated  model  estimates  made  throughout  the  run 
of  the  simulation.  This  set  includes  estimates  made  by  all  model  modules  at  all  levels. 
That  is,  at  any  point  during  the  simulation  run  the  analyst  is  required  to  provide  the 
most  accurate  and  recent  model  estimates.  This  may  also  include  the  updating  of  any 
metamodel  estimates  of  interest  to  the  model  analyst  (see  Section  3). 

•  Maintaining  integrated  model  validity.  The  model  analyst  must  maintain  vaidity  of 
the  overall  integrated  simulation  model.  This  is  to  be  done  on  a  continuing  basis 
throughout  the  simulation  run  via  periodically  frequent  updates  on  each  model  module. 

For  each  of  the  three  tasks  listed  above  the  analyst  will  have  the  ability  to  alter  the 
frequency  with  which  individual  model  modules  are  inspected  so  as  to  improve  the  run 
efficiency  of  the  integrated  model  as  well  as  the  accuracy  level  of  the  model  estimates.  That 
is,  if,  for  a  particular  module,  the  analyst  determines  that  it  only  needs  to  “view”  the  module 
every  26t  time  units  instead  of ,  say,  the  original  8t  time  units  then  that  particular  module 
efficiency  will  be  improved.  Similarly,  if  obtaining  data  from  a  particular  model  module  every 
6t  time  units  yields  insufficient  accuracy  for  a  specific  estimate,  then  the  model  analyst  will 
be  able  to  increase  the  frequency  at  which  the  model  data  is  generated  to,  say,  .5St  time 
units;  thereby  improving  estimator  accuracy.  In  other  words,  one  may  look  upon  the  process 
of  obtaining  a  data  point  from  a  lower- level  model  as  taking  a  “snapshot”  of  that  model 
and,  at  that  point  in  time,  retreiving  the  desired  information.  This  process  in  repeated  in 
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Figure  3:  Data  Flow 
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order  to  accumulate  a  database  from  that  model.  The  quality  of  the  statistical  estimates 
constructed  from  that  database  depends,  in  large  part,  upon  when  and  how  frequently  the 
data  where  collected.  Consequently,  giving  the  model  analyst  the  power  to  adjust  when 
the  data  are  collected  will  allow  him  to  adjust  for  the  quality  of  the  desired  estimates.  We 
anticipate,  and  recommend,  that  understanding  and  clearly  identifying  all  of  these  roles  that 
the  model  analyst  will  play  in  the  integrated  modeling  process  will  be  a  critical  part  of  the 
overall  modeling  effort.  Indeed,  the  model  analyst’s  role  in  the  integrated  model  is  the  most 
important  element  for  it  will  directly  determine  the  quality  of  the  entire  simulation.  Thus,  it 
is  imperative  that  this  component  of  the  integrated  model  be  fully  researched  and  understood. 
Currently,  Rome  Laboratories  has  devoted  little  effort  to  this  portion  of  the  integrated  model 
effort.  In  fact,  this  aspect  of  the  integrated  model  is  new  to  the  simulation  community  too 
and  very  little  work  has  been  done  in  this  area.  Consequently,  we  believe  that  it  is  very 
important  to  use  the  analysis  framework  so  as  to  provide  a  sound  and  scientific  means  of 
guiding  Rome  Laboratories  in  piecing  together  the  many  parts  of  the  integrated  simulation 
model.  Next,  we  discuss  how  the  statistical  estimation  processes  should  be  conducted. 

3.  Statistical  Estimation 

In  this  section  we  outline  the  statistical  estimation  procedures  within  the  integrated,  hierar¬ 
chical  simulation  model  framework.  Identification  and  use  of  proper  statistical  estimates  is 
critical  to  obtaining  a  clear  understanding  of  the  integrated  model  results. 

There  are  two  types  of  statistical  estimation  problems  that  will  be  encountered  by  the 
integrated  simulation  model.  They  are:  (a)  point  and  confidence  interval  estimation  for  a 
given  output  population  y,jk  and  (b)  regression  model  (metamodel)  estimation  for  a  given 
lesponse  set  of  responses  ijjk  and  their  associated  sets  of  inputs  x,jk  and  u\jk-  Both  types  of 
estimation  problems  need  to  be  considered  for  univanatt  and  multivariate  cases  since  both 
may  be  encountered  at  any  model  module  in  the  integrated  model.  Here,  we  consider  only  the 
univariate  estimation  type  problems  and  recognize  that  many  of  the  steps  outlined  herein 
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can  be  adapted  to  the  multivariate  case.  Properly  indentifying  the  necessary  procedures 
for  conducting  statistical  estimation  in  the  simulation  context  will  help  to  ensure  proper 
interpretation  of  the  results  and  more  accurate  measures  of  performance. 

Unfortunately,  statistical  estimation  (rarely  an  easy  problem  in  itself)  is  made  more 
difficult  when  applied  to  the  simulation  context.  This  is  the  result  of  the  following  conditions: 

(a)  the  sequence  of  outputs,  y,  obtained  from  a  simulation  run  are  often  correlated  and 

(b)  the  sequence  of  outputs,  y,  obtained  from  a  simulation  run  are  often  not  identically 
distributed.  Under  these  two  conditions  statistical  estimation  becomes  extremely  difficult. 
However,  there  are  some  remedies  that  have  been  developed  in  the  simulation  literature  that 
can  help  overcome  some  aspects  of  these  difficulties.  Unfortunately,  they  are  not  universally 
applicable  and  often  do  not  provide  a  consistent  measure  of  improvement  across  all  types  of 
simulation  situations. 

First,  we  consider  the  problem  of  estimating  the  mean  and  variance  of  the  populations 
of  the  responses  of  interest.  Running  a  computer  simulation  program  can  be  viewed  as  gen¬ 
erating  a  sample  of  observed  responses  values  (for  each  response  variable  y,;it).  Typically, 
the  characteristics  of  interest  for  this  population  are  the  mean  fi,jk  and  the  variance 
Because  of  the  problems  mentioned  at  the  beginning  of  this  section  with  regard  to  simu¬ 
lation  data,  the  estimation  of  these  two  population  parameters  cannot  be  done  well  using 
classical  statistical  estimation  techniques.  For  this  reason,  we  provide  the  following  outline 
for  conducting  statistical  estimation  for  each  y.jjt  within  the  integrated  model: 

•  For  each  response,  y.j*,  of  interest,  determine  the  effect  of  initialization  bias  on  the 
mean  This  would  entail  establishing  a  warm-up  period  for  each  model  module  in 
the  integrated  model  and  truncating  the  initial  “biased”  data  obtained  from  this  warm¬ 
up  period.  (Note  that  there  may  be  different  warm-up  periods  for  different  repsonses 
within  a  single  model  module.)  Proper  establishment  of  these  warm-up  periods  will 
help  to  ensure  that  the  the  estimates  of  the  will  be  unbiased.  Currently,  there  are 
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several  algorithms  for  determining  the  optimal  initial  warm-up  period  under  different 
conditions. 

•  Determine  how  “batching”  is  to  be  done  for  each  response  yijk  in  the  model  during  the 
simulation  run.  Identification  of  a  proper  batch  size  will  reduce  the  effects  that  the 
dependency  structure  present  in  the  sequence  of  observed  responses  and,  thus,  help  to 
ensure  unbiased  estimation  of  the  population  variances  As  with  the  initialization 
bias  problem  above,  there  may  be  different  recommended  batch  sizes  for  each  response, 

in  the  integrated  model.  There  are  several  algorithms  for  establishing  optimal 
batch  sizes  under  various  simulation  contexts. 

•  Establish  algorithms  for  determining  optimal  run  lengths  for  each  of  the  model  mod¬ 
ules  comprising  the  integrated  model.  This  would  necessarily  involve  utilizing  the 
estimation  information  obtained  from  both  the  estimation  of  and  crf^i^  for  all  re¬ 
sponse  populations  and  performing  a  sequentially  updated  set  of  comparisons  to  the 
established  accuracy  levels  indicated  by  the  model  analyst  prior  to  simulation.  Estab¬ 
lishment  of  such  a  set  of  algorithms  would  greatly  improve  the  efficiency  of  the  entire 
integrated  model. 

In  addition  to  these  three  primary  steps  to  performing  proper  estimation  during  the  sim¬ 
ulation,  rules  should  also  be  established  for  identifying  how  the  multiple  estimation  problems 
within  the  integrated  model  context  are  interrelated. 

Next,  we  provide  an  outline  for  conducting  regression  model  estimation  within  the 
integrated  model  context.  First,  we  introduce  some  basic  notation  to  help  clarify  the  issues. 

(.'unsidei  ti  simulation  experiment  consisting  of  m  design  points,  where  each  design  point 
is  identified  by  the  li-dimensional  vector  v?  of  design  variables  (factors)  that  are  deterministic 
inputs  to  the  simulation  model.  Let  the  univariate  response  from  the  pth  design  point 
be  denoted  by  ijg  and  let  the  vector  of  responses  from  all  m  design  points  be  denoted  by 
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y  =  (j/i)  2/2j  •  •  • )  Also,  let  (Pg  denote  the  settings  of  the  d  factors  for  the  ^'th  design  point 
and  let  {x/iO  :  h  =  1,2, ...  ,p  —  1}  represent  known  functions  of  the  factor  settings.  Then, 
assuming  that  the  relationship  between  the  response  and  the  given  functions  of  the  factor 
settings  is  linear  in  the  unknown  parameters,  we  can  write 

p-i 

=  i^o  +  51  l3hXh{<Pg)  +  Cg  for  p  =  1, 2, . . . ,  m,  (1) 

h=l 

where  {13^  :  h  =  0,l,...,p  —  1}  are  the  unknown  model  parameters  and  tg  represents 
the  inability  of  the  term  (/3o  +  ECi determine  y^.  Define  X  to  be  the 
m  X  p  design  matrix  whose  first  column  is  all  ones  and  whose  {g,h  +  1)  element  is  X)^[(pg) 
{g  =  1,2, ...  ,m  and  /i  =  1,2, ...  ,p  —  1).  Thus,  the  relationship  between  the  response 
and  the  functions  of  the  factor  settings  across  all  m  design  points  can  be  written  compactly 
as  the  following  general  linear  model: 

y  =  X/3  +  €,  (2) 

where /3  =  (/?o,  A, •  •  •  i/?p-i)'  and  e  =  (ci,e2.- •  •  ,Cm)'- 

If  the  {x;,}  are  chosen  such  that  X  is  orthogonal  then: 

X'X  =  mip.  (3) 

This  can  be  achieved  by  a  simple  reparameterization,  or  coding,  of  the  functions  of  the  factor 
levels. 

We  also  assume  that 


{y.:i  =  l,2,...,r}  IID  ~N,„(X/3,S),  (4) 

where  S  is  the  m  x  m  covariance  matrix  of  y,  (for  i  =  1,2,...,  r). 

for  our  purposes,  the  design  factors  indicated  in  the  above  discussion  would  be  com¬ 
prised  of  both  x,jk  and  w,jk  inputs.  Basically,  these  regression  models  would  be  used  to 
approximate  the  relationships  between  a  particular  response  variable  of  interest  at  a  given 
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model  module  and  the  set  of  input  factors  that  are  fed  into  that  model  module.  Knowledge 
of  such  a  regression  relationship  can  be  used  for  any  one  of  three  powerful  purposes:  (a) 
control,  (b)  prediction,  and  (c)  model  validation.  Thus,  the  regression  relationship  could  be 
used  to  predict  results  without  having  to,  necessarily,  perform  a  simulation  run.  It  could 
also  be  used  to  control  certain  modules  so  that  reasonable  outputs  estimates  could  be  ob¬ 
tained  for  them  without  having  to  run  that  particular  module  during  the  simulation  run  of 
the  integrated  model.  This  has  the  potential  of  fantcistically  reducing  the  needed  CPU  time 
for  a  particular  run  or  set  of  runs.  Lastly,  the  regression  model  could  be  used  to  help  the 
model  analyst  validate  portions  of  the  integrated  system.  All  of  these  uses  for  regression 
models  fitted  to  simulation  output  have  been  well  documented  in  the  literature.  However, 
to  date,  no  one  has  attempted  to  simultaneously  implement  all  such  methods  in  one  large 
scale  simulation  model  such  as  the  integrated  model  context  here. 

In  the  following,  we  list  the  recommended  steps  necessary  for  properly  conducting  a 
designed  simulation  experiment  for  the  purpose  of  estimating  a  regression  function  of  the 
type  indicated  in  equation  (1).  Again,  these  steps  would  have  to  carried  out  for  each  set  of 
inputs  and  the  associated  response  of  interest  for  all  model  modules. 

•  Determine  which  relationships  are  to  be  approximated  by  a  regression  model. 

•  Hypothesize  a  functional  form  for  each  model  of  interest. 

•  Determine  the  experimental  design  that  is  to  be  used. 

•  Validate  the  key  assumptions  made  in  equation  (4)  above. 

•  Estimate  the  unknown  parameters  in  the  model. 

•  Test  for  the  adequacy  of  the  fitted  model. 

Successful  completion  of  these  steps  will  provide  the  model  analyst  with  a  valid  regres¬ 
sion  approximation  that  may  be  used  for  any  of  the  three  purposes  above.  Currently,  it 
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remains  unclear  as  to  all  of  the  uses  such  regression  models  may  have.  This  aspect  should 
be  of  great  interest  to  the  model  analyst  and  has  the  potential  for  great  payoffs. 

In  this  section  we  have  given  an  outline  of  the  steps  necessary  in  order  to  conduct 
good  estimation  routines  within  the  model  modules  of  the  integrated  simulation  model.  The 
primary  goal  of  the  model  analyst  should  be  to  fully  integrate  these  steps  for  each  model 
module  into  the  larger  agenda  of  the  entire  integrated  model.  The  le.el  of  coordination 
required  to  accomplish  this  task  is  considerable  and  has  not  been  done  previously.  However, 
proper  definition  and  understanding  of  all  relevant  terms  as  well  as  a  clear  view  of  the  desired 
outcome  can  greatly  facilitate  in  bringing  it  about. 

4.  Variance  Reduction  Techniques 

In  this  section  we  provide  a  menu  of  variance  reduction  techniques  (VRT)  that  we  think  are 
essential  to  the  integrated  model  effort.  Variance  reduction  techniques  are  analysis  methods 
that,  under  certain  general  conditions,  can  greatly  reduce  the  variance  of  the  estimators 
used  in  a  simulation  model  without  adversely  affecting  other  r.spects  of  the  estimation  issue 
(such  as  estimator  bias).  Typically,  variance  reduction  techniques  do  not  involve  significant 
additional  computational  effort  and,  thus,  can  be  thought  of  as  gleaning  more  information 
from  the  simulation  run  v/ithout  extra  runtime  costs. 

Basically,  variance  reduction  techniques  include:  (a)  antithetic  variates,  (b)  control 
variates,  and  (c)  common  random  numbers.  Each  of  these  methods  involves  inducing  cor¬ 
relations  of  a  prescribed  sign  between  the  responses  from  a  simulation  model.  The  idea  is 
10  use  this  induced  correlation  to  further  improve  the  estimation  of  the  population  char¬ 
acteristic  (mean  or  variance)  of  interest.  Typically,  researchers  have  focused  attention  on 
the  implementation  of  these  techniques  to  single  simulation  runs.  However,  recent  attention 
has  begun  to  focus  on  utilizing  them  across  multiple  run  simulation  experiments.  This  is  of 
particular  relevance  to  the  integrated  simulation  model  for  two  reasons.  First,  by  nature, 
the  integrated  simulation  model  will  involve  multiple  simulation  runs  from  the  various  model 
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modules  that  comprise  the  integrated  model.  Second,  many  variance  reduction  applications 
have  suggested  that  their  effectiveness  becomes  enhanced  for  large-scale  simulation  models. 
Clearly,  the  integrated  simulation  model  will  be  a  large-scale  simulation  model  and,  hence, 
the  potential  will  exist  for  reaping  large  benifits  from  the  application  of  these  techniques  to 
the  integrated  model  evironment. 

Consider  the  simple  simulation  experiment  discussed  in  the  previous  section,  where  r 
represents  a  random  sample  from  the  uniform  distribution  on  the  unit  interval  [0,1].  For  a 
single  replication  of  the  basic  m-point  experimental  design,  we  represent  the  complete  set 
of,  say  k  random  number  streams  in  the  following  way:  (a)  the  sequence  of  random  numbers 
available  from  the  jth  stream  at  the  gth.  design  point  is 

(5  =  1, 2, . . . ,  m  and  j  =  1,2, . . . ,  fc);  (b)  the  set  of  streams  at  the  ^th  design  point  is 

~  •  •  •  )  ^gn)  (5) 

{g  =  l,2,...  ,m);  and  (c)  the  aggregate  random  number  input  for  the  basic  m-point  experi¬ 
mental  design  is 

R*°  =  (Rx,R2,...,R,„). 

Now  at  the  ^th  design  point,  Rj  completely  determines  the  events  of  the  associated  simula¬ 
tion  run  so  that  we  can  write 

Vgi^g)  =  +  +  e(Rs)-  (6) 

h=l 

In  conducting  a  simulation  experiment,  the  analyst  must  assign  a  particular  set  of 
random  number  streams  Rg  to  each  design  point  g  {g  =  l,2,...,m).  Three  conventional 
assignment  strategies  are:  (a)  independent  streams  {the  method  of  independent  replications), 
(b)  common  streams  ylhe  method  of  common  random  numbers),  and  (c)  antithetic  streams 
(the  method  of  antithetic  variates).  One  strategy  that  assigns  boih  antithetic  and  common 
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random  number  streams  to  the  design  points  is  described  in  the  following.  It  has  been  shown 
to  be  optimal  for  a  large  class  of  simulation  experiments. 

Assume  that  the  design  matrix  X  is  orthogonally  blockable  into  two  blocks.  That  is, 
there  exists  an  m  x  2  matrix  W  of  zeros  and  ones  such  that:  (a)  design  point  g  is  in  block  j 
if  and  only  if  Wgj  =  1  (^r  =  1,2, . . .  ,m,  and  j  =  1,2);  (b)  the  columns  of  W  are  orthogonal 
to  the  columns  of  T  so  that  T'W  =  (0p_i,0p_i);  and  (c)  we  have  lJ,iW  =  (mi, m2),  where 
mi  and  m2  are  positive  integers  respectively  representing  the  size  of  each  block  so  that 
mi  +  m2  =  m.  For  this  situation  we  may  use  the  following  assignment  rule: 

If  the  m-point  experimental  design  admits  orthogonal  blocking  into  two  blocks  of 
sizes  mi  and  m2,  preferably  chosen  to  be  as  nearly  equal  in  size  as  possible,  then 
for  all  mi  design  points  in  the  first  block,  use  a  set  of  pseudorandom  numbers 
R  =  (ri,r2,...),  chosen  randomly,  and  for  all  m2  design  points  in  the  second 
block,  use  the  corresponding  set  of  complementary  pseudorandom  number?  .  -- 
(1  -ri,l  -r2,...). 

it  has  been  shewn  that  this  strategy  yiela^  optimal  results  when  the  block  sizes  are 
equal;  that  is,  mi  =  m2.  This  strategy  involves  the  decomposition  of  the  error  term  ^(Rj)  at 
the  ^th  design  point  into  a  random  block  effect  ^(Rj)  and  a  residual  ^"(Rj),  both  of  which 
are  functions  of  R^.  Thus,  the  metamodel  in  (1)  can  be  written  as 

p-i 

yjCRj)  =/3o  + +  +  for  5  =  l,2,...,m,  (7) 

h=l 

to  emphasize  the  dependence  of  the  random  components  in  this  metamodel  on  the  random 
input  Rj.  In  addition,  we  make  the  following  assumptions  about  the  decomposition  of 
the  metamo>'el  error  terms  in  equation  (7):  (a)  the  errors  {e(R5)}  have  mean  zero  and  a 
constant  variance  cr^  across  all  design  points;  (b)  the  block  effects  {6(Rj)}  have  mean  zero 
and  are  not  correlated  with  the  residuals  {€'’(R3)};  (c)  the  residuals  fe°(Rp)}  have  mean 
zero  and  a.e  pairwise  uncorrelated;  (d)  there  is  a  nonnegative  correlation  between  twu  block 
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effects  obtained  with  common  random  numbers;  and  (e)  there  is  a  nonpositive  correlation 
between  two  block  effects  obtained  with  antithetic  (complementary)  random  number  streams. 
These  assumptions  imply  the  following  basic  properties  of  the  Schruben- Margolin  correlation- 
induction  strategy: 

1.  The  response  variance  is  constant  across  all  points  in  the  design  so  that 

aj  =  var[t/(Rg)]  =  0-2  for  y  =  (8) 

2.  There  is  a  constant  nonnegative  correlation  between  all  pairs  of  responses  and  y^ 
(y  ^  k)  that  are  realized  from  a  common  random  number  stream  R.  Thus  if  Rj  = 
Rfc  =  R,  then 

corr[yg(R),y*:(R)]  =  pi  for  1  <  y,  A:  <  m  and  g  ^  k^  where  0  <  pi  <  1.  (9) 

3.  There  is  a  constant  nonpositive  correlation  between  all  pairs  of  responses  y^  and  yjt 
(y  7^  ';)  that  are  realized  from  antithetic  random  number  streams  R  and  R  respectively. 
Thus  if  Rj  =  R  and  R*;  =  R,  then 

corr[yg(R),yii.(R)]  =  for  1  <  y,/;  <  m  and  g  ^  k,  where  -  1  <  ^2  ^  0-  (10) 

Then,  under  these  assumptions,  with  equal  block  sizes,  the  metamodel  in  (2)  takes  the 
following  form: 

y(R"*)  =  X/3  4- WB(R°*)  +  e®(R°*),  (11) 

vvhere:  y  =  y  =  mj  =  m2  is  the  corrxmon  block  size;  B(R°*)  =  [6i(R°*), 62(R°*)]'  is  the 
2x1  vector  of  random  block  effects;  W  is  the  m  x  2  block  incidence  matrix-,  and  e°(R°*)  is 
the  m  X  1  vector  of  residual  errors.  Note  that  within  each  block,  we  assume  a  common  block 
effect  that  does  not  depend  on  the  design  point.  Let  X,  represent  the  y  x  p  design  matrix 
for  the  jih  block  [j  =  1,2).  If  the  experimental  points  are  so  arranged  that  X  =  (X'i,X2)', 
then  we  have 

W  _  1^9 

[0,  1, 
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where  each  column  of  W  contains  9  =  y  ones.  With  the  assumptions  given  above,  we  have 

Cov(B)=A<’'  H.  (13) 

]_  P2  P\ 

Expressions  (12)  and  (13),  together  with  the  assumptions  stated  above,  result  in  the 
following  positive  definite  covariance  structure  for  y; 


where  det(Sy"‘)  ^  0,  and  Ej"  ute  q  x  q  matrices. 

The  following  theorem  pertains  to  simulation  experiments  performed  under  the  strategy 
described  above: 

Theorem  1:  If  an  experimental  design  admits  orthogonal  blocking,  and  if  the 
assumptions  of  (8),  (9),  and  (10)  hold,  then  under  the  assignment  rule  the  ordi¬ 
nary  least-squares  estimator  of  /3  has  a  smaller  generalized  variance  than  it  has 
under  the  following  strategies;  (a)  the  assignment  of  one  common  set  of  random 
number  streams  to  all  design  points,  or  (b)  the  assignment  of  independent  sets 
of  random  number  streams  to  each  design  point,  provided 

[l-f  (m  -  1)/Ji  -  (2mim2(m"*))(/?i  -  /)2)](1  -  P2f  <  1, 

in  the  latter  case. 

Corollary  1:  Under  the  assumption  of  Theorem  1,  the  assignment  rule  is  supe¬ 
rior  to  the  use  of  ^ohiiuon  random  numbers  in  estimating  /9oi  and  the  two  strate¬ 
gies  are  equivalent  in  terms  of  dispersion  for  estimating  (3^  =  (/?!,  /?2)  •  -  •  > 


14-20 


When  compared  to  the  use  of  independent  random  number  streams  at  each  point, 
both  the  assignment  rule  and  common  random  numbers  are  superior  in  terms  of 
dispersion  for  estimating  (5-^.  Thus,  this  strategy  is  an  efficient  means  of  com¬ 
bining  the  two  correlation  methods  of  common  random  numbers  and  antithetic 
variates  for  a  large  class  of  experimental  designs. 

The  presentation  given  above  indicates  how  some  variance  reduction  techniques  can  be 
successfully  applied  across  multiple  simulation  runs  which  constitute  a  simulation  experi¬ 
ment. 

In  the  context  of  the  integrated  model,  the  model  analyst  must  perform  many  simula¬ 
tion  experiments  simultaneously  (at  least  one  multiple  run  experiment  per  model  module). 
Consequently,  the  model  analyst  should  focus  much  attention  on  how  to  successfully  design 
and  implement  multiple  run  vaiiance  reduction  techniques.  To  that  end  we  give  the  following 
steps  outlining  what  must  be  done, 

•  Identify  all  random  components  in  each  model  module. 

•  Utilize  random  number  generators  that  facilitate  the  synchronized  use  of  ail  random 
numbers  in  every  model  module  across  the  entire  integrated  model. 

•  Select,  for  each  regression  model  that  is  to  be  estimated  (see  Section  3)  and  in  conjunc¬ 
tion  with  the  design  matrix,  a  random  number  assignment  strategy  so  as  to  maximize 
model  estimator  performance. 

»  For  each  response  of  interest,  select  an  optimal  set  of  concomitant  set  of  variables 
(control  variates)  that  are  highly  correlated  with  the  response.  These  “control  variates” 
are  then  used  to  “explain  away”  or  “control”  some  of  the  variability  in  the  original 
response. 

•  Combine  the  use  of  the  two  techniques  indicated  by  the  previous  two  items. 
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Successfully  impelementing  these  steps  will  yield  superior  performance  throughout  the 
integrated  model.  Unfortunately,  not  all  of  these  items  have  completely  prescribed  measures 
for  being  corried  out.  In  addition,  the  model  analyst’s  problems  are  further  complicated 
when  consideration  is  given  to  coordinating  these  various  variance  reduction  steps  from  one 
model  module  to  another.  Nevertheless,  enough  is  known  about  these  variance  reduction 
techniques  to  indicate  that  unparalleled  benefits  from  fully  integrating  them  throughout  the 
entire  integrated  model  is  assured.  Currently,  Rome  Laboratories  has  not  begun  this  phase 
of  the  integration  effort. 

5.  Summary 

Throughout  this  document  we  have  offered  what  we  recommend  as  an  effective  outline  for 
conducting  analysis  on  the  integrated  model.  Many  of  the  individual  items  listed  have  been 
e.Ktensively  developed  in  the  simulation  literature.  However,  little  work  has  been  done  on 
the  development  of  integration  methods  for  these  items  in  one  simulation  model.  However, 
the  integrated  simulation  model,  perhaps  because  of  its  sheer  size,  demands  that  careful 
consideration  be  given  to  this  issue  of  method  integration.  We  believe  that  adherence  to 
the  items  listed  throughout  this  document  will  provide  the  best  means  of  accomplishing  a 
truly  integrated  simulation  model  at  Rome  Laboratories.  Furthermore,  we  project  that  a 
successful  implementation  of  all  of  the  items  mentioned  herein  would  take  on  the  order  of 
2-5  man-years  to  complete. 
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HIGH  PERFORMANCE  MICROSTRIP  ARRAYS 
FOR  POLARIMETRIC  BISTATIC  RADAR(PBR) 

APPLICATIONS 


Marat  Davidovitz 
Assistant  Professor 
Department  of  Electrical  Engineering 
University  of  Minnesota 

Abstract 

Novel  approaches  to  Polarimetric  Bistatic  Radar  antenna  design  were  investigated 
with  the  goal  of  improving  such  parameters  as  cross-polarization  and  isolation.  A 
feeding  arrangement  for  linear  microstrip  arrays,  which  facilitates  compact  array  con¬ 
figurations,  was  proposed.  A  design  methodology  was  developed  to  implement  the 
developed  designs.  A  small-scale  prototype  was  built  and  tested  in  order  to  verify  the 
viability  of  the  new  concepts.  Preliminary  calculations  were  carried  out  to  design  a 
full-scale  array  for  the  PBR  experimental  facility  at  the  RL,  Hanscom  AFB. 
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1  Introduction 


Antenneis  used  for  polarimetric  radar  measurements  are  required  to  satisfy  a  unique  combi¬ 
nation  of  criteria.  Among  these  are  low  cross-polfirization,  high  isolation  between  orthogonal 
polarization  channels,  large  azimuthal  resolution  and  low  side  lobes.  Moreover,  mechanical 
considerations  may  present  additional  constraints  on  the  size  and  weight  of  the  antenna  struc¬ 
ture.  This  may  be  particularly  important  for  bistatic  measurement  configurations,  wherein 
the  antennas  may  be  re-deployed  many  times  in  the  course  of  a  measurement  run. 

The  maturing  microstrip  antenna  technology  may  afford  the  solution  to  the  problems 
posed  by  polarimetric  radar.  Arrays  of  microstrip  elements  can  provide  the  required  polar¬ 
ization  and  beam  capabilities.  Microstrip  designs  also  have  the  added  advantages  of  being 
small(thin),  light-weight,  and  relatively  inexpensive. 

The  primary  task  described  in  this  report  concerns  the  design  of  a  receiving  microstrip 
antenna  for  a  bistatic  polarimetric  radar  system.  The  work  was  carried  out  at  the  Rome 
Laboratories  ERCE  section,  under  the  auspices  of  the  AFOSR  Summer  Faculty  Program. 

The  following  guidelines  were  issued  for  the  design: 

•  Frequency:  Center  fe  =  3.200GHz,  Baud  Width  BW  «  6%. 

•  Polarization:  Dual  Orthogonal  Linear  (H,V) 

-  Cross-Polarization  <  20dB 

-  Orthogonal  Port  Isolation  >  25dB 

•  Patterns 

-  3dB  Beam  Width  HPBW  <  6® 

-  Side  Lobe  Level  SLL  <  -20dB 

The  proposed  approach  entails  the  design  of  a  microstrip  array  containing  square  patch 
elements.  The  array  is  to  be  fed  by  a  hybrid  microstrip/coaxial  feed  network  incorporating 
an  illumination  taper  required  to  achieve  the  specified  SLL.  The  design  tasks  which  have 
been  carried  out  to  date  are: 

•  Design  of  the  single  element  for  the  proper  resonant  frequency 

•  Design  of  a  2x2  low-cross  polarization  sub-array  module,  which  will  form  the  basic 
building  block  for  the  final  structure 
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•  Design  of  the  feed  network,  including  the  proper  amplitude  taper  for  the  specified  SLL 
0  Analysis  of  r<indom  phase  error  effects  on  the  SLL 

0  Photographic  mask  layout  of  the  preliminary  design  for  the  purpose  of  experimental 
verification 

The  listed  tasks  are  described  in  more  detail  in  the  subsequent  sections  of  the  report. 
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Single  Element  Design 

There  exist  numerous  methods  for  analyzing  the  properties  of  single  element  patch  anten¬ 
nas.  They  range  from  the  analytically  simple  first  order  solutions  to  the  rigorous,  full-wave 
methods,  requiring  extensive  computation.  The  latter,  if  properly  executed,  yield  the  most 
accurate  results. 

In  order  to  obtain  a  successful  design  in  as  few  iterations  as  possible,  a  numerical  code 
based  upon  a  full-wave  electromagnetic  analysis  was  utilized  to  find  the  patch  dimension 
flfor  the  specified  resonant  frequency  (f^  =3.20GHz)  and  the  given  substrate  permittivity 
and  thickness,  as  well  as  to  compute  the  input  impedance  of  the  patch  over  the  frequency 
band  of  interest.  The  analytical  details  have  been  documented  in  previous  publications 
[1],[2],  [3].  The  last  reference  contains  a  polynomial  expression  for  the  resonant  frequency  of 
a  rectangular  patch  of  a  given  dimension.  The  expressions  are  reported  to  be  valid  over  a 
wide  range  of  patch  aspect  ratios  and  substrate  permittivities.  Although  these  expressions 
are  not  directly  useful  in  design,  they  can  be  inverted  by  means  of  root  searching  procedures. 
Such  an  algorithm  was  implemented  in  the  Fortran  program  attached  in  the  Appendix. 

The  design  documented  in  this  report  was  somewhat  constrained  by  the  availability  of 
substrate  material.  Based  upon  bandwidth  considerations,  use  of  low  dielectric  constant, 
thick  substrates  is  desirable.  As  clearly  illustrated  in  Figure  1,  for  a  fixed  dielectric  constant 
(in  this  case  c,  =2.33)  the  bandwidth  is  almost  a  linear  function  of  the  substrate  thickness. 
The  preliminary  design  was  carried  out  for  a  substrate  characterized  by  c,  =2.33  and  thick¬ 
ness  f=3.i75mm,  which  was  the  most  suitable  material  available  locally.  Referring  back  to 
Fig.  1  it  is  seen  that  the  estimated  BW  for  a  2:1  input  SWR  is  approximately  4%.  To 
achieve  greater  bandwidth,  a  substrate  which  either  has  a  lower  Cr  or  is  slightly  thicker  needs 
to  be  utilized. 

The  resonant  dimension  of  the  square  patch  was  calculated  with  the  aid  of  the  afore¬ 
mentioned  Fortran  code.  For  the  chosen  substrate  (cr  =2.33,<=3.175nun)  this  dimension  is 
a=27.9mm.  This  information  was  subsequently  fed  into  a  full-wave  code  to  calculate  the 
input  impedance  for  the  edge-fed  squeire  patch  with  a=27.9nun.  The  feed  point  was  cen¬ 
tered  between  the  patch  comers.  The  results  are  shown  in  Figure  2.  The  displayed  data 
are  normalized  with  500.  As  expected,  the  impedance  locus  crosses  the  real  axis  at  3.2GHz. 
The  real  part  of  the  input  impedance  at  that  frequency  is  Eg  =  2440. 


2.1  Low  Cross-Polarization  Four-Element  Module 

The  single  element  square  patch  antenna  ordinarily  exhibits  a  high  level  of  cross-polarization. 
This  is  primarily  due  to  the  presence  of  impurity  fields  associated  with  higher-order  modes 
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excited  by  the  feed  line.  Several  means  of  suppressing  the  high-order  modes  have  been 
developed  in  the  past  [4],  [5].  In  the  case  of  the  single  element  antenna,  the  dominant 
impurity  modes  can  be  eliminated  by  oxciting  the  patch  at  the  opposite  sides  with  180® 
difference  between  the  two  ports.  Alternatively,  one  can  implement  the  same  idea  in  the  form 
of  a  four  single-port  element  sub-array  module.  In  this  configuration  the  four  elements  are 
paired  and  the  two  pairs  are  fed  at  the  opposite  sides  180®  out  of  phase.  In  this  arrangement 
the  fields  associated  with  the  JS02  modes,  which  are  the  primar'^  contributors  to  the  cross¬ 
polarization,  approximately  cancel  each  other  in  the  far  field.  At  the  same  time,  the  Eio-mode 
fields  radiated  by  all  four  patches  will  reinforce  each  other  to  form  the  desired  co-polarized 
field. 

Shown  in  Figure  3  is  a  four  patch  sub-array  designed  in  accordance  with  the  outlined 
principles.  This  specific  layout  was  produced  for  the  =2.33,  t=3.175rr  n  substrate  and  the 
resonant  frequency  f^  =3.2GHz.  The  feed  lines  between  the  50n  coaxial  connector  and  the 
244fl  microstrip  patch  input  port  are  ISSQ,  lOOfl  and  70.711. 

2,2  Dual  Linear  Polarization  Array  Layout 

The  most  straightforward  approach  one  can  take  to  produce  two  orthogonal  linear  fields  of 
high  purity  is  to  use  two  separate  four  element  modules  described  in  the  preceding  section. 
The  two  sub-arrays  would  have  to  be  positioned  at  90®  with  respect  to  one  another.  This 
arrangement  is  inefficient  from  the  standpoint  of  substrate  area  utilization.  In  order  to 
improve  upon  this  design  it  is  necessary  to  use  the  same  set  of  four  patches  to  transmit  both 
polarizations.  Thus  each  patch  will  be  fed  at  two  ports  to  produce  orthogonal  modes  in  the 
element  ‘cavity’  and  orthogonal  polarizations  in  the  far  field.  Although  this  is  possible  in 
principle,  the  actual  implementation  is  complicated  by  the  fact  that  two  non- intersecting 
feed  networks  are  to  be  positioned  in  a  limited  area  around  the  four  patches.  The  limits 
of  this  area  are  determined  by  the  spacing  of  the  elements  in  the  overall  array.  A  possible 
arrangement  which  may  solve  the  layout  problem  for  the  linear  array  under  consideration  is 
shown  in  Figure  4.  In  the  proposed  geometry,  each  vertical  (with  respect  to  the  line  of  the 
complete  array)  pair  of  elements  forms  a  part  of  two  orthogonally  polarized  modules.  Thus 
constructed,  the  dual  polarized  array  occupies  the  same  substrate  area  as  would  be  needed 
to  support  an  array  of  single  polarization  modules. 

There  is  a  trade-off  between  the  physical  compactness  of  the  array  and  the  level  of 
isolation  between  the  orthogonal  polarization  input  ports.  Use  of  two  completely  separate 
four  element  sub-arrays  to  radiate  the  Co-  and  X-Pol  fields  yields  the  highest  level  of  isolation. 
Reported  results  [6]  indicate  that  the  signal  in  the  orthogonal  port  for  such  an  arrangement 
is  typically  more  than  40dB  below  the  input  level.  In  the  more  compact  arrangements. 
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Table  1:  Input  Am] 
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10 

the  proximity  of  the  various  feed  lines  belonging  to  the  orthogonally  polarized  sub-arrays 
is  expected  lo  lead  to  stronger  cross-coupling  of  the  two  input  ports.  A  more  reasonable 
estimate  for  the  isolation  level  in  this  case  is  25-30dB. 

2.3  Radiation  Patterns 

Among  the  performance  parameters  specified  for  the  proposed  antenna  are  the  radiation 
pattern  Half-Power  Beam  Width  (HPBW)  and  Side  Lobe  Level  (SLL)  in  the  azimuthal 
plane.  Since  the  down  range  resolution  is  controlled  by  the  radar,  the  elevation  pattern  can 
be  broad.  Therefore  a  linear  array  geometry  would  serve  the  purpose.  To  achieve  the  required 
HPBW  =  6®  2uid  SLL  =  20dB  a  proper  number  of  elements  and  a  suitable  input  weight 
distribution  have  to  be  selected.  Prelimiriiry  estimates  had  indicated  that  approximately 
16  elements  spaced  d=0.6Ao  apart  should  ;>roduce  the  desired  HPBW.  Equivalently,  8  four 
element  modules  of  the  type  described  in  •  .le  preceding  sections  can  be  used. 

To  achieve  the  required  SLL  a  tapere*;  element  coefficients  distribution  is  required.  An 
initial  choice  for  the  input  coefficients  was  based  on  a  30dB  Chebyshev  design  for  an  8 
element  array.  The  taper  can  be  impleminted  by  means  of  attenuators  inserted  into  the 
coaxial  feed  lines  carrying  the  signal  from  the  power  divider  to  the  individual  sub-arrays. 
The  array  coefficients  obtained  from  the  initial  Chebyshev  design  corresponded  to  non¬ 
integral  values  of  attenuation.  Since  fixed-\  due  attenuators  are  typically  available  in  steps 
of  IdB,  it  would  be  considerably  simpler  U'  have  an  input  amplitude  distribution  which 
can  be  realized  with  commercially  available  units.  Using  the  initial  Chebyshev  design  as  a 
guide,  an  iterative  simulation  procedure  was  carried  out  in  order  to  obtain  an  illumination 
distribution  quantized  in  steps  of  IdB.  The  final  design,  summarized  in  Table  I,  yielded  a 
theoretical  radiaUon  pattern  in  which  the  SLL  did  not  exceed  24dB.  The  azimuthal  patterns 
of  a  linear  array  of  8  four-element  modules  spaced  O.GAq  apart  and  excited  with  the  proposed 
amplitude  taper  are  shown  in  Figure  5.  Both  the  horizontal  and  the  vertical  polarization 
patterns  are  displayed.  It  is  clear  from  these  plots  that  the  worst  side  lobe  amplitude  is  below 
24dB  for  either  polarization.  This  is  4dB  lower  than  the  specified  value.  This  margin  of  safety 
is  important  in  offsetting  .several  potential  error  sources  which  were  not  explicitly  accounted 
for  in  the  design  process,  and  which  may  bring  the  SLL  up  in  the  actual  antenna.  The  most 
important  of  these  are  phase  errors  in  the  array  input  signals  and  spurious  radiation  from  the 
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microstrip  feeding  lines.  The  effects  of  the  latter  are  quite  difficult  to  estimate  analytically. 
In  order  to  gauge  the  effects  of  random  phase  errors,  the  patterns  of  the  array  recomputed 
with  randomly  generated  phase  error  distributions.  A  Gaussian  random  number  generator 
was  used  to  produce  the  input  phase  distributions.  The  results  of  simulations  with  various 
types  of  random  phase  errors  are  shown  in  Figures  6-9.  The  first  two  of  these  show  the  H- 
and  V-Pol  patterns  obtained  with  a  Gaussian  phase  error  with  zero  mean  and  5®  standard 
deviation.  The  consequence  of  this  induced  error  is  a  2dB  deterioration  in  the  SLL,  placing 
the  worst  side  lobe  amplitude  at  approximately  22dB.  Therefore,  this  error  magnitude  is 
tolerable.  How'ever,  as  Figures  8,9  illustrate,  when  the  standard  deviation  of  the  random 
error  is  increased  to  10®,  the  SLL  exceeds  the  allowed  level.  This  leads  to  the  conclusion 
that  the  accuracy  of  the  fabrication  and  assembly  procedures  must  be  sufficient  in  order  to 
ensure  phase  errors  smaller  than  approximately  5®. 
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3  Preliminary  Test  Results 

The  modules  shown  in  Figs.  3,4  have  been  fabiicated  and  tested  on  an  HP8510  Network 
Analyzer  to  ascertain  such  parameters  as  the  resonant  frequency,  impedance  bandwidth  and 
in  the  case  of  the  two-input-port  sub-array  in  Fig.  4  the  interport  coupling.  The  measured 
frequency  dependence  of  the  various  parameters  is  illustrated  in  Figures  10-13. 

The  input  SWR  for  the  four-element  array  in  Fig.3  was  measu»-ed  in  order  to  find  the 
resonant  frequency  and  the  bandwidth.  The  network  analyzer  output  is  attached  in  Figure 
10,  whence  it  is  seen  that  the  measured  resonant  frequency  is  3.215  GHz,  and  the  2:1  SWR 
bandwidth  is  130  MHz.  The  discrepancy  between  the  experimental  and  the  specified  value  of 
fc  is  on  the  order  of  0.5%.  The  BW  corresponds  very  closely  with  the  theoretical  predictions. 
The  input  SWR  at  the  ports  of  the  two  module  array  shown  in  Fig.  4  behaves  in  the  Scime 
fashion  as  in  the  isolated  module  case.  The  measured  results  are  given  in  Figs.  11-12. 

The  interport  coupling  in  the  two-port  sub-array  represents  a  very  important  perfor¬ 
mance  parameter.  Any  signal  coupled  from  port  one  into  port  two  may,  if  the  later  port  is 
unmatched,  reradiate  energy  into  the  undesired  polarization.  Minimization  of  this  coupling 
mechanism  constituted  an  important  design  consideration.  The  measured  intercoupling  data 
are  displayed  in  Figure  13.  At  the  center  frequency  of  the  design  band  the  coupled  signal 
is  on  the  order  of  -31dB.  At  the  band  edges  it  is  slightly  higher,  -25dB  and  -27dB,  respec¬ 
tively.  It  may  therefore  be  concluded  that  the  executed  design  very  effectively  isolates  the 
two  orthogonal  polarization  ports. 

Co-  and  X-Pol  radiation  patterns  of  the  two-module  sub-array  were  measured  for  both 
the  H  and  the  V  polarizations.  The  patterns  taken  at  the  resonant  frequency  are  shown 
in  Figures  22-25.  Figure  22,23  represent  the  Co-pol,X-pol  patterns,  respectively,  obtained 
when  the  H-port  is  excited.  The  X-pol  level  for  this  case  is  20  dB.  The  V-port  Co-  and  X-pol 
patterns  are  shown  in  Figs.  24,25,  respectively.  In  this  case  the  the  cross-polarized  field  is 
approximately  17db  lower  than  the  co-polaxized  components.  This  value  somewhat  exceeds 
the  specified  20  dB  X-pol  target.  The  primary  reason  for  the  discrepancy  is  the  proximity 
of  the  networks  feeding  the  H  and  V  ports.  Power  from  either  feed  tends  to  couple  to  the 
other  network  and  eventually  is  reradiated  into  the  X-pol  component.  Several  simple  ways 
for  alleviating  this  problem  can  be  suggested.  The  simplest  would  require  the  inter-element 
spacing  to  be  increased  from  the  current  value  of  O.OAq  to  0.7Ao  or  greater,  thereby  loosening 
the  coupling  between  the  feeds.  Another  approach,  one  which  would  result  in  the  least 
amount  of  coupling  between  the  two  feeds,  would  require  the  complete  physical  separation 
of  the  arrays  for  the  H  and  V  polarizations.  These  alternate  arrangements  will  be  evaluated 
experimentally  in  the  follow-up  design. 
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Bandwidth  (%) 


Figure  2:  Calculated  input  impedance  for  a  square  patch  antenna:  a=27.9mm, 
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polarization  module. 


Figure  4:  A  compact  feeding  arrangement  for  dual  polarized  array. 
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Figure  5:  Horizontal  and  Vertical  polarization  power  patterns. 
( - - — H-POL, . V-POL  ;  d=0.6Ao  ) 
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Figure  6:  H-POL  patterns: -  no  phase  error, . Gaussian  phase  error  with  0® 

mean,  5®  standard  deviation. 
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Figure  7:  H-POL  patterns: -  no  phase  error, . Gaussian  phase  error  with  0® 

mean,  10®  standard  deviation. 
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Figure  8:  V-POL  patterns;  - 
mean,  5®  standard  deviation. 


no  phase  error, . Gaussian  phase  error  with  0® 


Figure  9:  V-POL  patterns: -  no  phase  error, . Gaussian  phase  error  with  0® 

mean,  10®  standard  deviation. 
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Figure  10:  Input  SWR  for  the  isolated  4-element  module. 
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Figure  11:  Input  SWR  for  the  H-Pol  port  of  the  two-module  array. 
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Figure  12:  Input  SWR  for  the  y-Pol  port  of  the  two- module  array. 
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Figure  13:  Interport  coupling  in  the  two-module  array. 
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Figure  14:  Horizontal  co-polarized  radiation  pattern. 
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Figure  15:  Horizontal  cross-polarized  radiation  pattern. 
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Figure  16;  Vertical  copolarized  radiation  pattern. 
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Figure  17:  Vertical  cross-polarized  radiation  pattern. 
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oooooooooooooooooooooooooo 


c  c  c  c  c  c 


CCCCCCCCCCCCCCSCCCCCCCCC'CCCCCC 

THIS  PBOGRMi  KAS  USED  TO  DESZOS  K  iXitTA  EtEHEKT  “ODGLE  CF  SQOAaS  MICRO¬ 
STRIP  PATCH  AKTENNAS. 

INPllT  PARAMETERS: 

EPSR  -  SUBSTRATE  RELATIVE  PERMiTriVlTi 
HSOB  -  SUBSTRATE  THICKNESS 

fREQ  -  CENTER  FREQUENCY  IN  TSE  DESIGN  n.VRD,  I,E.  THE  PATCH 
RESONANT  FREQUENCY 

RIH  -  INPUT  RESISTANCE  OF  THE  PATCH  AT  F-FREQ  (EXTERNALLY 
COMPUTED) 

OUTPUT  DATA: 

A  -  THE  SQUARE  PATCH  DIMENSION 

LTRANS,HTRANS  -  LENGTH,  WIDTH  OF  THE  QUAR  TIZR-HAVE  TRAi:SFORMER 
USED  TO  HATCH  THE  PATCH  IMPEDAtJCE  TO  100  OHMS 
LDIVID,KDIVID  -  LENGTH,HIDTH  OF  THE  MATCHING  SECTION  IN  THE  POWER 
DIVIDER;  THIS  LINE  HAS  A  '/O.T  OHMS  CHARACTERISTIC 
IMPEDANCE 

WLINE  -  WIDTH  OF  THE  100  OHMS  LINE 

OFFSET  -  THE  OFFSET  DISTANCE  OF  THE  f.-3D  POINT;  REQUIRED  TO 
INTRODUCE  AN  180  DEGREE  PHA^f:  SHIFT  BETWEEN  THE  TWO 
SIDES  OF  THE  MODULE 

CCCCCCCCCCCCCCCCCCCCCCCC,  CCCCCCCCCCCC 
program  modul 

inpliclt  double  precision  (a-h,o-z) 

double  precision  ltrans,ldivid,laindaO 

cononon/releps  /epsr 
common/thick/hsub 

open  {unit«21, f ile-'modin' ) 

read(21,*) 

read (21,*)  epsr 

read(21,*) 

read(21,*)  hsub 

read(21,*) 

read (21,*)  freq 

read(21,*) 

read(21,*)  rin 

close  (21) 


lamdaO-3 . d8/f req 
zline-lOO. 

C  DETERMINE  THE  PATCH  DIMENSION  FOR  THE  GIVEN  RESONANT 
C  FREQUENCY 

call  patsiz (£req,a) 

C  CALCULATE  THE  QUARTER-WAVE  TRANSFORMER  PARAMETERS 

ztrans-sqrt (rin*zline) 
call  mspara  (ztrans,wohtr,epseft) 
ltrans-lamdaO/4 . /sqrt (epseft) 
wtrans-wohtr*hsub 

C  CALCULATE  THE  POWER  DIVIDER  MATCHING  SECTION  PARA>.:’  ,r,s 

zdivid-zline/sqrt (2 . ) 

call  msoara  (zdivid.wohdiv.eosefd) 
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ldivid*’laiDdaO/4 .  /sqrt  (epsefd) 
wditrid'Wobdiv^bSTib 

C  OOCOIATE  FEED  POIHT  OFFSET  I.M!BDA(100obai)/4. 

call  mspsra  (zline,wohlOO,epslOO) 
o££set>lsJt>daO/4 .  /  sgrt  CepslOO) 
wliiie-vcblOO*bsub 


open(anit' 
write (31, 
write (31, 
write (31, 
write (31, 
write (31, 
write (31, 
write (31, 
write (31, 
Write (31, 
write (31, 
write (31, 
close (31) 
end 


-31, £ile-'nodata' ) 

*)  'Square  Patch  Size  -  a-', a 
'Quarter-Wave  X-torroer' 

' 20-' , ztrans 

' length-' , Itrans , '  Width-' , wtrans 
'Power  Divider' 

'20-',zdivid 

'  Length—' ,  Idivid, '  Width-'  ,  wdivid 
-'Main  Feed  Line' 

'Z0-',zline 
'wline-' ,wline 
'Feed  Offset  -', offset 


*) 

*) 

*) 

*) 

*) 

*) 

*) 

*) 

*) 

*) 


subroutine  patsiz (£req,a) 
ijq>licit  double  precision  {a-h,o-z) 

consnon/ fdesir /£  reqO 
common/ releps/epsr 
common/thick/hsub 
ewtsrnal  resfre 

freqO-freq 

alow-5 .  *hsu'D 
ahigh“1000 , *hsub 
toler«l . Od-lC 

a-zercin (alow, ahigh, resfre, toler) 
end 

double  precision  function  resfre (a) 
implicit  double  precision  (a-h,o-z) 

common/fdesir/freq 

common/releps/epsr 

commor/thick/hsub 

call  re3rec(epsr,a,a,hsub,fre3,q) 
resfre-fres-freq 

end 

subroutine  resrec (er,a,b,t,fres,q) 
implicit  double  precision  (a-h,o-z) 


data  ai00,al01,al02,al03,all0,alll,all2,all3. 
t  al20,al21,al22,a]23,a200,a201,a202,a203. 

(  a210,a2ll,a212,a?13,a220,a221,a222,a223/ 

(  -3. 156'/54397448,  1.795725628752,-0.2652696640599, 

(  1.3023599874332d-2, 

&  0.3030205876506,-0.8132378043906,0.1377825867855, 

t  -7.4426027077389d-3, 

S  -6. 999192346233d-2. 0.2074387779152. -3. 78469754 61942d-2. 
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2.1295393807366d-3, 

14. «7374793072> -9. 117720788346,1.204181550099, 
-5.37275866126e7d-2, 

-3 . 919355911932, 4.283895471007,-0 . 5480065590744, 
2.3856532089789d-2, 

1.2689142466212,-1.322922648767,0.1932147231985, 

-9.20955010l2563d-3/ 

data  a300, a301, a302 , a303, a310, 3311, a312 , a31 3, 
a320 , a321, a322 , a323/ 

-24.64990432442,14.07938052452,-1-241188231639, 

3.0849725458221d-2, 

7. 104107341725, -5. 734060839448, 3. C0209083841d-2, 
2.9052167395763d-2, 

-3.089017025567,2.374351824763,-0-176778025576, 

9.907781784825d-4/ 

data  blOO ,bl01, bl02 , bl03, bllO , bill, bll2, bll3, 
bl20,bl21,bl22,bl23,b200,b201,b202,b203, 
b210,b211,b212,b213,b220,b221,b222,b223/ 

-4 . 6152911d-2, 2 . 7101483d-2,-4 .4631474d-3, 

2 . 1826716d-4, -0 . 7032761, 0 . 2455350, -3 , 200743^d-2, 

1 . 4145372d-3, 0.1795260,-7 . 5435199d-2, 1 . 0676009d-2, 

-4 . 9243571d-4, -0 . 8322784 , -6 . 4469166d-2 , 3 . 5479948d-2, 
-2..2502933d-3, 3.735017,-1. 411132 , 0 .1898225, 

-8 .5380571d-3, -1 . 135043, 0.4898767, -7 . 0118777d-2, 
3.2546222d-3/ 

data  b300,b301,b302,b303,b310,b311,b312,b313, 
b320,b321,b322,b323/ 

2. 135107,-0. 2780477, -1.4273385d-2, 2. 2362657d-3, 
-7.907233, 3. 316067, -0.4600076, 2. 0703688d-2, 

2. 526486, -1.141029. 0.1641063, -7. 5149969d-3/ 


al0-al00+al01*Er+al02*Er**2+al03*Er**3 

all-allO+alll*Er+all2*Er**2+all3*Er**3 

al2-al20+al21*Er+al22*Er**2+al23*Er**3 

a20-a200+a201*Er+a202*Er**2+a203*'Er**3 

a21-a210+a211*Er+a2i2*Er**2+a213*Er**3 

a22-a220+a221*Er+a222*Er**2+a223*Er**3 

a30<-a300+a301*Er+a302*Er**2+a303*Er**3 

a31-a310+a311*Er+a312*Er**2+a313*Er**3 

a32-a320+a321*Er+a322*Er**2+a323*Er**3 

al-alO+all* (a/b) +al2*  (a/b) **2 
a2-a20+a21* (a/b) +a22* (a/b) **2 
a3-a30+a31* (a/b) +a32* (a/b) **2 

Wr-1 .+al* (t/b) +a2* (t/b) **2+a3* (t/b) **3 

bl0-bl00+bl01*Er+bl02*Er**2+bl03*Er**3 

bll-bllO+blll*Er+bll2*Er**2+bll3*Er**3 

bl2-bl20+bl21*Er+bl22*Er**2+bl23*Er**3 

b20-b200+b201*Er+b202*Er**2+b203*Er**3 

b21-b210+b211*F.i:b212*Er**2+b213*Er**3 

t22-b220+b221*er+.->222*Er**2+b223*Er**3 

b3'.'-b300+b301*Er+b302*Er**2+b303*Er**3 

b31-b310+b311*.Xr+b312*Er**2+b313*Er**3 

b32-b320+b321*Er+b322*Er**2+b323*Er**3 

bl-blC^bll* (a/b) +bl2* (a/b) **2 
b2-b20+b21* (a/b) +b22* (a/b) **2 
b3-b30+b31* (a/b) +b32* (a/b) **2 

Wi»bl^ (t/o) +b2* (t/b) **2+b3* (t/b) **3 
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w0-3.d8/ (2.*a)/sqrt (Er) 
£res-Hr*wO 
q— Hr/2./Hi 

write(*,*)  'Reson.  Freq.-',Kr*wO 
end 


subroutine  mspara  (zl,woh,epse£) 
inpllclt  double  precision  (a-h,o-z) 

comroon/desirz/ zO 
external  zcharO 

zO-zl 

wohlow»0.000001 
wohigh~1000 . 
toler-1 . Od-10 

woh-  zeroin (wohlow, wohigh, zcharO, toler ) 
epse£-epse£f (woh) 

write (*,*)  'w/h-',woh, '  epseff-' ,epsef 


end 


double  precision  function  zcharO (woverh) 
itnplicit  double  precision  (a-h,o-z) 
common/desirz/zl 

zcharO-zchar (woverh) -zl 


return 

end 

double  precision  function  zchar (woverh) 
inrolicit  double  precision  (a-h,o-z) 
common/releps/epsr 


pi-3.1415926535 

£l-6.+(2.*pi-6, ) *exp(- (30.666/woverh) **0.7528) 
zhomog-60.*dlog(fl/woverh+sqrt(l.+ (2. /woverh) **2) ) 
zchar-zhomog/sqrt (epseff (woverh) ) 


return 

end 

double  precision  function  epseff (woverh) 
iniplicit  double  precision  (a-h,o-z) 
ccTunon/releps/epsr 


a-1 .+dlog ( (woverh**  4+ (woverh/ 52 . ) **2) / (woverh  **4+0 .432) ) /49 .+ 
&  dlog (1 .+ (woverh/18 . 1) **3) /18 .7 
b»0 .564* ( ( (epsr-0 . 9) / (epsr+3. ) ) **0 . 053) 
epseff- (epsr+1. ) /2 .+(epsr-l . ) /2./ (1. +10. /woverh) ** (a*b) 

return 

end 

To  get  dlmach,  mail  netlib 

QAnH  mmar*V>  ■fr'Am 


15-30 


double  precision  function  zeroin{ax,bx, f,tol} 
double  precision  ax,bx,£,tol 
c 

c  a  zero  of  the  function  f(x)  is  computed  in  the  interval  ax,bx  . 
c 

c  input . . 
c 

c  ax 
c  bx 
c  f 
c 

c  tol 
c 
c 

c  output • . 
c 

c  zeroiit  abscissa  approximating  a  zero  of  f  in  the  interval  ax,bx 
c 

c  it  is  assumed  that  f(ax)  and  f(bx)  have  opposite  signs 

c  this  is  checked,  and  an  error  message  is  printed  if  this  is  not 
c  satisfied.  zeroin  returns  a  zero  x  in  the  given  interval 
c  ax,bx  to  within  a  tolerance  4*m3cheps*abs (x) +tol,  where  macheps  is 
c  the  relative  machine  precision  defined  as  the  smallest  representable 
c  number  such  that  l.+macheps  .gt.  1. 

r  this  function  subprogram  is  a  slightly  modified  translation  of 

<;  the  algol  60  proci  dure  zero  given  in  richard  brent,  algorithms  for 
c  minimization  witho  it  derivatives,  prentice-hall,  inc.  (1973). 
c 

c-iuble  precision  a,b,c,d,e,eps,fa,fb,fo,toll,xm,p,q,r,s 
double  precision  dabs,  dlmach 
10  eps  “•  dlmach (4) 
toll  <•  eps+l.OdO 
c 

a-ax 
b-bx 
fa-f (a) 
fb-f (b) 

c  check  that  f(ax)  and  f(bx)  have  different  signs 
if  (fa  .eq.O.OdO  .or.  £b  .eq.  O.OdO)  go  to  20 
if  (fa  *  (fb/dabs (fb) )  .le.  O.OdO)  go  to  20 
write (6, 2500) 

2500  format (lx, ' f (ax)  and  f (bx)  do  not  have  different  signs,', 

1  '  zeroin  is  aborting' ) 

return 

20  c-a 
fc-fa 
d-b-a 
e-d 

30  if  (dabs (fc) .ge .dabs (fb) )  go  to  40 
a«b 
b-c 
c-a 
fa-fb 
fb-fc 
fc-fa 

40  toil«2 . 0d0*eps*dabs (b) +0 . 5d0*tol 
xm  -  0.5d0* (c-b) 

if  ( (dabs (xm) ,le. toll) .or. (fb. eq.O.OdO) )  go  to  150 
c 

c  see  if  a  bisection  is  forced 
c 

if  ( (dabs (e) .ge. toll) .and. (dabs (fa) .gt. dabs (fb) ) )  go  to  50 
d-xm 

go  to  3 1C 

C  A  *  ._£\.  i  je  m. 


left  endpoint  of  initial  interval 
right  endpoint  of  initial  interval 

function  subprogram  which  evaluates  f (x)  for  any  x  in 
the  interval  ax,  bx 

desired  length  of  the  Interval  of  uncertainty  of  the 
final  result  (.ge.O.) 
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"30000000000000000000000 


if  (a.ne.c)  go  to  60 


c  linear  interpolation 
c 

p*'2.0d0*xni*s 
q-1 . OdO-s 
go  to  70 
c 

c  inverse  quadratic  interpolation 
c 

60  q-fa/fc 
r-fb/fc 

p-s* (2 . OdO*xm*q* (q-r) - (b-a) * (r-l. OdO) ) 
q- (q-1 .OdO) * (r-l .OdO) * (s-1. OdO) 

70  if  (p.le.O.OdO)  go  to  80 

q— q 

go  to  90 

80  p"-p 
90  s«e 
e-d 

if  ( { (2 .OdO*p) .ge. (3.0d0*3an*q-dabs {toll*q) ) ) .or . (p.ge. 
*dabs{0.5d0*s*q) ) )  go  to  100 
d-p/q 
go  to  10 
100  d“3rm 
e-d 
110  a-b 
fa-fb 

if  (dabs (d) .le. toll)  go  to  120 

b-bvd 

go  to  140 

120  if  (xm.le.O.OdO)  go  to  130 
b-b+toll 
go  to  140 
130  b-b-toll 
140  fb-£(b) 

if  ( (fb* (fc/dabs (fc) ) ) .gt.O.OdO)  go  to  20 
go  to  30 
150  zeroin-b 
return 
end 

DOUBLE  PRECISION  FUNCTION  DlMACHd) 

DOUBLE-PRECISION  MACHINE  CONSTANTS 

D1MACH(  1)  -  B**  (EMIN-1),  THE  SMALLEST  POSITIVE  MAGNITUDE. 
D1MACH(  2)  -  B**EMAX*(1  -  B**(-T)),  THE  LARGEST  MAGNITUDE. 
D1MACH(  3)  -  B**(-T),  THE  SMALLEST  RELATIVE  SPACING. 

D1MACH(  4)  -  B**(l-T),  THE  LARGEST  RELATIVE  SPACING. 

D1MACH(  5)  -  LOGIO(B) 

ALTER  THIS  FUNCTION  FOR  A  PARTICULAR  ENVIRONMENT, 

'  HE  DESIRED  SET  OF  DATA  STATEMENTS  SHOULD  BE  ACTIVATED  BY 
<EMOVING  THE  C  FROM  COLUMN  1. 

ON  RARE  MACHINES  A  STATIC  STATEMENT  MAY  NEED  TO  BE  ADDED. 

(BUT  PROBABLY  MORE  SYSTEMS  PROHIBIT  IT  THAN  REQUIRE  IT.) 

FOR  IEEE-ARITHMETIC  MACHINES  (BINARY  STANDARD) ,  ONE  OF  THE  FIRS^ 
TWO  SETS  OF  CONSTANTS  BELOW  SHOULD  BE  APPROPRIATE. 

MHPRF  pnRRTRT.F.  riFTTMAT..  OFTAT.  f3R  UFYAnrf'TMAT  f'AMC'TAMT'C  ADP  iicm 
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USER  ASSISTED  INFORM/\TION  EXTRACTION 


Pradip  Peter  Dey 
Hampton  University 
Computer  Science  Department 
Hampton,  VA  23668 


0 .  ABSTRACT 

User  assisted  information  extraction  from  formated  and 
unformated  messages  requires  a  friendly  user  interface, 
knowledge  based  natural  language  processing,  and 
utilities  for  integrating  future  developments  in  message 
processing  technology.  Parallel  and  distributed 
processing  will  play  an  important  role  in  the  future 
enhancement  of  the  message  processing  technology.  Two 
research  tools  were  developed  in  the  summer  of  1991  that 
allow  one  to  experiment  with  a  prototype  information 
extraction  system  available  in  the  IRDS  branch  of  Rome 
Laboratory.  Some  components  of  the  prototype  system  can 
be  executed  concurrently  by  means  of  these  tools.  In  order 
to  exploit  the  enormous  potential  of  the  system  a  detailed 
study  of  parallel  processing  architectures,  algorithms, 
data  and  control  structures  for  the  system  is  recommended. 
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1.  INTRODUCTION 

This  report  describes  the  research  conducted  between  June  3  and  August  9, 
1991  in  the  IRDS  branch  of  Rome  Laboratory  under  the  AFOSR  program.  The 
research  deals  with  user  assisted  information  extraction.  One  of  the  goals  of 
IRDS  has  been  to  develop  and  test  a  generic  system  in  this  area.  A  prototype 
system  called  Generic  Intelligence  Processor  (GIP)  is  available  in  IRDS  for 
experimental  puiposes.  An  enhanced  version  of  GIP  is  under  development  now. 
In  addition,  further  long-term  enhancements  of  GIP  are  already  being  planned. 
With  GIP,  IRDS  has  undenaken  a  challenging  task  of  putting  an  emerging 
technology  into  work.  GIP  is  proposed  to  be  developed  in  evolutionary  stages 
gradually  adding  new  components  and  enhancing  the  system  as  the  technology 
develops.  GIP  has  enormous  potential  for  technology  transfer  in  some  .'irucial 
areas  as  it  provides  an  environment  for  transferring  both  short  and  long-term 
developments.  This  research  focuses  on  long-term  enhancement  of  GIP  rather 
than  immediate  enhancement  that  is  already  undenaken.  The  specific  long-term 
enhancement  tiiat  is  addressed  here  is  the  parallelization  problem  of  GIP.  GIP 
must  be  studied  for  parallel  and  distributed  execution  for  two  main  rea.'ions:  (a) 
GD?  is  a  complicated  system  requiring  substantial  computing  power,  (b)  Since 
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GIP  would  be  used  in  a  distributed  environment,  resource  and  load  sharing 
among  the  processors  of  the  environment  is  a  reasonable  goal. 

Two  research  tools  for  investigating  parallel  and  distributed  execution  of  GIP 
modules  have  been  developed  during  the  summer  of  1991.  The  tools  are:(l) 
Remoteacquire,  and  (2)  Remotenlp.  Remoteacquire  is  used  for  acquiring  a  large 
number  of  messages  from  a  remote  source.  Remotenlp  is  used  to  process  several 
messages  on  a  remote  site  and  obtain  the  results  back  to  the  host.  These  tools 
are  described  in  section-3.  These  tools  allow  Rome  Laboratory  to  conduct  some 
preliminary  experiments  in  parallel  execution  of  GIP  modules.  A  detailed  study 
of  parallel  and  distributed  execution  of  GIP  has  to  be  conducted  at  a  later  time. 


2.  GENERIC  INTELLIGENCE  PROCESSOR  (GIP) 

A  detailed  description  of  GIP  is  available  in  a  technical  proposal  written  by  the 
Knowledge  Systems  Concepts,  Inc.  (KSC  1991).  The  main  goal  of  GIP  is  to 
extract  information  from  formatted  and  unformatted  messages  and  transmit  the 
results  to  downstream  processes  such  as  expert  systems.  GIP  is  a  toolkit  for 
developing,  testing,  and  delivering  message  processing  applications.  When  fully 
developed  GIP  will  provide  the  following  capabilities  (a)  user  selectable  input 
sources,  (b)  automatic  message  parsing,  (c)  automatic/user  assisted  information 
extraction,  (d)  automatic  ouq)ut  formatting  for  development  systems,  (e)  utilities 
for  developing  new  algorithms.  For  processing  messages  with  substantial  natural 
language  texts  user  assistance  will  be  crucial  for  GIP,  because,  natural  language 
processing  technology  is  not  mature  enough  for  fully  automated  information 
extraction  in  an  operational  environment.  Therefore,  there  is  considerable 
emphasis  on  user  interface  in  the  GIP  design.  The  GIP  program  components  are 
reproduced  in  Figure- 1. 
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Components  (from  KSC  1991 


As  mentioiied  eariicr,  GIP  is  a  complicated  system  lequiiiiig  emamous 
computing  power.  E£Bcieni  execudtm  of  GIP  is  possible  only  by  clever 
utilizaiioo  of  coaqaiting  lesauices  in  the  GIP  enviionmenL  In  <Hder  to  achieve 
this,  parallel  and  distributed  execution  of  GIP  mrdules  would  be  ncccssaiy.  A 
detailed  study  of  parallel  processing  aichitectuies.  algtxithms,  data  and  control 
stnictuies  for  GIP  will  be  lequiied.  This  pieliminaiy  study  provides  two  tools 
that  allow  user  controlled  concunent  execution  of  some  GIP  components  for 
experimental  puiposes. 

3-  TWO  EXPERIMENTAL  TOOLS 

GIP  must  perform  in  a  distributed  environment  where  several  computers  are 
networked  together  for  sharing  re:  ources  and  computation.  Two  tools  have  been 
developed  that  allow  this  sharing  on  an  experimental  basis.  These  tools  are  also 
useful  to  analyze  GlP’s  parallel  processing  requirements  and  design  and 
development  The  tools  are  called  Remoteacquire  and  Remotenlp. 

3.1.  Remoteacquire 

Remoteacquire  is  a  tool  that  acquires  messages  from  remote  sources.  It  uses 
the  rep  protocol  from  Networking  Tools  and  Programs  (Sun  Microsystems)  for 
copying  an  entire  directory  from  a  remote  source.  It  prompts  for  the  name  of  a 
remote  source  from  which  messages  have  to  be  acquired.  Once  the  name  is 
given,  the  messages  are  acquired  over  the  network  which  can  be  processed  by 
GIP’s  other  modules.  Remoteacquire  can  be  concurrently  executed  with  some 
other  components  (such  as  the  extraction  component)  of  GIP. 


32.  Remotenlp 


Remocoilp  is  a  uxd  that  executes  a  aattiral  language  pFOcessw  (NLP)  on  a 
remote  site  and  obtains  the  results  back  in  the  GIF  enviiciinicnL  It  uses  the  rsh 
protocol  firom  Netwtnldng  Tools  and  Programs  (Sun  Microsystems).  While  the 
NLP  is  being  executed  on  die  remote  »te,  GIP*s  other  modules  can  be  executed 
on  the  local  host  This  produces  large  grain  parallelism  at  a  high  level. 
Parallelism  at  another  level  is  simulated  by  Renxitenlp  by  creating  several 
processes  and  executing  a  different  message  in  each  of  the  processes. 


4.  PARALLEL  ARCHTTECTURES  FOR  GIP 

There  are  three  basic  architecture  types:  (1)  SISD  =  single  instruction  single 
data  stream.  Traditional  sequential  computers  are  of  this  type.  A  single 
instruction  is  executed  at  a  time  on  a  single  piece  of  data.  (2)  SIMD  =  single 
instruction  multiple  data  stream.  A  single  instruction  is  executed  in  a  number  of 
processors  on  different  pieces  of  data.  Synchronization  is  automatic  in  this 
architecture.  A  processor  either  executes  the  same  instruction  with  other 
processors  or  remains  idle.  T’.ie  connection  machine  is  an  SIMD  machine  (Hillis 
1985).  (3)  MIMD  =  multiple  instruction  multiple  data  stream.  Different 
instructions  can  be  executed  in  different  processors  on  different  pieces  of  data. 
There  are  two  types  of  MIMD  computers:  (a)  Multicomputers  =  loosely  coupled 
systems.  In  this  architecture  each  processor  has  its  own  local  memory  and 
communications  between  the  processors  are  achieved  through  an  interconnection 
network.  They  are  also  known  as  distributed  systems.  The  hypercube  computers 
are  of  this  type  (Athas  and  Seitz  1988). 
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(b)  MuldpiDcessors  =  tightly  coupled  ^sterns.  In  these  ^sterns,  the  memory 
is  commoo  to  all  piocesstns.  The  processtws  share  a  number  of  global  variables 
stmed  in  the  ccnnmon  memory.  These  variables  enable  the  processors  to 
c(Htunur?lcate  cf^aently  through  the  shared  memory  if  the  number  of  processors 
is  not  too  large.  Two  examples  of  commercially  available  shared  memory 
muldprocessors  are  thf*  BBN  Butterfly  system  (Rettberg  and  Thomas  1986)  arid 
the  Sequent  Balance  system  (Osterhaug  1985).  Algorithms  designed  for 
muldprocessors  are  known  as  asynchronous  parallel  algorithms.  An 
asynchronous  algorithm  is  a  collecdon  of  processes,  some  or  all  of  which  are 
executed  simultaneously  on  a  number  of  available  processors.  When  the 
parallel  algorithm  begins  execudon  on  a  processor,,  it  creates  a  number  of 
processes  to  be  performed.  If  free  processors  are  available,  these  processes  are 
assigned  to  the  processors  to  perform  the  necessary  computadons.  Otherwise,  a 
process  is  queued  and  waits  for  a  processor  to  be  free.  When  a  processor 
completes  execution  of  a  process,  it  becomes  free. 
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Figure  2.  Two  Main  Queues  of  a  Multiprocessor 
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Multipiocessots  veiy  popular  because  tb^  aie  easy  to  program. 
Multiprocessors  woik  with  two  queues:  a  process  queue  and  a  processor  queue. 
The  main  thing  a  programmer  has  to  do  is  create  an  optimal  number  of  processes 
out  of  a  program.  The  processes  are  queued  in  the  process  queue.  Free 
processors  wait  in  the  processor  queue.  If  processes  are  waiting  in  the  process 
queue  then  they  are  assigned  to  available  processors  from  the  processor  queue. 
After  a  process  is  executed,  the  processo*'  becomes  free  and  is  queued  back  to 
the  processor  queue. 

There  is  another  interesting  but  specialized  parallel  architecture  of 
computation  known  as  the  neural  network  in  which  a  large  number  of  processors 
(similar  to  neurons)  are  interconnected  and  processing  is  done  by  spreading 
activation  (McClelland  et  al  1986).  By  passing  values  processors  change  the 
connective  strengths  among  themselves  and  propagate  activations  through  the 
network.  This  propagation  constitutes  processing  by  the  system.  The  neural 
network  is  an  attractive  model  of  parallel  computation  for  building  trainable 
systems  (Howells  1988). 

When  fully  developed  GIF  would  be  a  very  large  and  complex  system  that 
would  require  a  hybrid  architecture  for  efficient  execution  of  its  varied 
components.  For  example,  a  neural  network  architecture  may  be  needed  for  some 
of  the  learning  components  of  GIF;  whereas  a  shared  memory  architecture  could 
be  used  for  the  natural  language  processor  (Dey  and  Hayashi  1989).  Further 
studies  would  be  necessary  to  specify  the  exact  architecture  for  GIF.  However,  at 
this  time,  it  can  be  concluded  that  GIF  would  require  a  hybrid  architecture  with 
some  MIMD  components,  since  GIF  must  allow  asynchronous  parallelism. 
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5.  NATURAL  LANGUAGE  PROCESSING  IN  THE  GIP  ENVIRONMENT 


Natural  Language  Processing  technology  is  matme  enough  to  be  put  in 
certain  real  world  use  with  appropriate  user  assistance.  Information  extraction 
from  messages  is  one  of  the  most  promising  fields  for  this  technology.  User 
assistance  would  be  required  in  the  extraction  process  imtil  some  future  break¬ 
through  makes  totally  automated  natural  language  understanding  possible.  GIP 
requires  robust,  user-oriented,  modifiable,  parallelizable  NLP  modules.  GIP  is 
recommended  to  have  at  least  two  alternative  NLP  modules  so  that  users  would 
be  able  to  compare  the  results  of  the  two  NLP  modules. 

An  experimental  study  of  integrating  an  NLP  system  in  the  GIP  environment 
has  been  conducted  in  the  summer  of  1991.  The  main  goal  of  the  study  has  been 
to  broadly  specify  the  requirements  of  the  NLP  modules  of  GIP.  These 
requirement  are  briefly  described  below: 

(1)  GIP  would  require  well-documented  NLP  modules.  In  order  to  integrate  the 
NLP  modules  documents  like  algorithms,  description  software  design  documents 
and  user’s  manual  would  be  necessary.  Software  development  models  used  for 
the  modules  should  be  briefly  described. 

(2)  The  NLP  component  should  be  properly  modularized.  The  interfaces  between 
modules  should  be  well-defined. 

(3)  The  NLP  modules  should  allow  parallelization  for  efficient  execution. 
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(4)  Scalability  should  be  considered  as  an  essential  feature  of  die  NLP  modules. 
It  is  often  required  to  incoiporate  this  provision  in  the  design  level;  otherwise, 
the  modules  have  to  be  redesigned. 


(5)  For  messages  with  free  text,  GIP  NLP  modules  should  emphasize 
discourse/text  analyses  rather  than  sentential  analyses.  Ftonominal  references 
should  be  resolved  in  examples  like: 

(5.1)  John  has  been  dating  Mary  for  the  past  two  years.  He  is  planning  to 
marry  her. 

(6)  Semandc  interpretation  needs  to  exploit  different  sources  of  Knowledge  such 
as  lexical  knowledge,  syntactic  knowledge  and  domain  knowledge.  Prepositional 
phrases  like  "OF  THE  PRC  AND  THE  SOVIET  UNION"  in  the  following 
fragment  of  text  cause  problems  for  natural  language  systems  that  do  not 
properly  integrate  knowledge  sources. 

(6.1)  LIMA,  25  OCT  89  (EFE)  -  [TEXT]  POLICE  HAVE  REPORTED  THAT 
TERRORISTS  TONIGHT  BOMBED  THE  EMBASSIES  OF  THE  PRC  AND 
THE  SOVIET  UNION. 


(7)  Appropriate  error  handling  capabilities  must  be  prov  Jed  in  the  modules. 
Whenever  necessary,  the  control  should  be  returned  to  the  user. 

(8)  The  NLP  modules  should  be  modifiable  otherwise  GIP  will  not  be  able  to 
deal  with  the  emerging  technology. 

(9)  Most  NLP  systems  provide  a  learning  module  at  the  lexical  level.  Some 
learning  or  adaptive  capabilities  must  be  provided  in  addition  to  the  lexical  level. 
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APPENDIX:  Programs 
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I* 

~  ^pyiremoteacquirex 
Peter  Dey 

This  function  copies  interactively  an  entire  directory  from  another  site 
using  rep  .  It  prompts  for  a  remote  pathname  that  has  to  be  coppied. 
See  also  getgouldx 
See  also  remotenlpx 


*1 

#  include  "stdioJi" 

#  include  "stdlibJi** 


mainO 

{ 

FILE  f; 
int  status; 

prinifC*  INTERACTIVE  MESSAGE  ACQUISITION  FROM  REMOTE  MESSAGE-SOURCE  n"); 
printf("  THE  NOTATION  FOR  ENTERING  MESSAGE-SOURCE  IS:  usemamc@siie:directory  n  n") 
printf("  ENTER  THE  MESSAGE-SOURCE:  "); 
scanfC'%s".  &f): 
if  (forkO  =  0) 

execl("/usr/ucb/rcp".  ’‘rep",  "-r".  f.  "Demo",  NULL); 

wail(&status); 

if  (forkO  =  0) 

pruuf( "Message  Acquisition  CompletcO); 
waii(&status); 

) 
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/* 

~gip/remotenlp . c 
Peter  Dey 

This  program  sends  messages  to  another  site  using  rep. 

Then,  it  processes  them  on  that  remote  site  using  remote  procedure 
calls.  Finally  it  transmits  the  results  back  to  the  this  site. 

See  also  ~gip/remoteacquire.c 

*/ 


#  include  "stdio.h" 

#  include  "stdlib.h" 


main ( )  { 

int  status; 
if  (forkO  0) 


execl ("/usr/ueb/rep". 

"rep". 

"Demo/testdir2/msl", 

"gip@buckwheat :msl". 

NULL) 

wait (& status) ; 
if  (forkO  -■  0) 
execl ("/usr/ueb/rep". 

"rep". 

"Demo/testdir2/ms2", 

"gip@buckwheat :ms2". 

NULL) 

wait (& status) ; 
if  (forkO  ”  0) 
execl ("/usr/ueb/rep". 

"rep". 

"Demo/testdir2/ms3", 

"gip@buckwheat :ms3". 

NULL) 

wait (fistatus) ; 
if  (forkO  ”  0) 
execl ("/usr/ueb/rsh". 

"rsh". 

"buckwheat",  "muc3test 

",  "msl". 

NULL) ; 

wait (Sstatus) ; 
if  (forkO  «  0) 
execl ("/usr/ueb/rsh". 

"rsh". 

"buckwheat",  "muc3test 

",  "ms2". 

NULL)  ; 

wait (Sstatus) ; 
if  (forkO  —  0) 
execl ("/usr/ueb/rsh", 

"rsh". 

"buckwheat",  "muc3test 

",  "ms3". 

NULL)  ; 

wait {& status) ; 
if  (forkO  —  0) 

execl ("/usr/ueb/rep”,  "rep",  "gip@buckwheat:msl.fill",  "msl.fill",  NULL); 

wait (Sstatus) ; 
if  (forkO  0) 

execl ("/usr/ueb/rep",  "rep",  "gip@buckwheat :ms2 .fill",  "ms2.fill",  NULL); 
wait (Sstatus) ; 
if  (forkO  0) 

execl ("/usr/ueb/rep",  '‘rep",  "gip@buckwheat:ms3.fill",  "ms3.fill",  NULL); 

wait (fistatus) ; 

execl  ("/usr/ucb/cat",  "cat",  "msl.fill",  "ms2.fill",  "ms3.i.ill",  NULL); 

} 
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/* 

/hane/gip/denol  -  c 
Peter  Dey 

This  program  uses  suntools  to  sake  an  on-line  slide.  The  slide  can  be 
used  in  the  gip  environment  during  a  gip  demo.  Coo^ile  the  file  by: 

cc  -o  demol  demol.c  -Isuntcol  -Isunwindow  -Ipixrect 
After  con^ilation  the  object-code  would  be  in  demol.  Enter  the 
gip  demo  environment  and  then  execute  demol  via  a  shell  tool. 

*/ 


finclude  <stdio.h> 
iinclude  <siintooi/sunview  h> 
finclude  <suntool/panel . h> 
finclude  <suntool/icon.h> 

mainC  ) 

{ 

Frame  frame; 

Panel  panel; 

Pixfont  *bold; 

bold  =»  pf_open("/usr/lib/fonts/fixedwidthfonts/cour.b-24")  ; 
if (bold  —  NOLL)  exit(l); 

frame  *  window_create  (NOLL,  FRAME,  FRAME_LA3EL,  ”  GIP",  0) 

panel  =  window_create (frame,  PANEL,  WIN_FONT,  bold,  0); 
panel_create_item (panel,  PANEL_MESSA{S, 

PANEL_LABEL_STRING,  "  GENERIC  INTELLIGENCE  PROCESSOR  (GIP)", 

0); 

window_f  it  (pa'^el) ; 
window_main_loop (frame) ; 

} 


IS  naiDB  a  ODi^ine  sidcL  Tbe  sSde  cat  dc 


wed aite  ^  CBfvinaaM dBrioc  a deaaiL  Compile  die  fik  ty: 
cc  -o  -Inaannl  AsmmimSam  Aetata 

Afitr  cnaqaltfiiHi  iha  oiliK^oode  acakl  be  ia  <kaD2.  Enter  staaosis 
or  dk  ^  doBS  cannaaeat  aad  dm  euaae  ikaatl 


nite*baUt 

boM = ff  apcirAadHdlwiiArMd»i*MWBa^^ 

ir(baU  =  KUU)caa(lk 

fiMM-  -ira<ir-  nr-Tf""*  csttic  t  AWtn  ' 

iiml  ii[iliiii~iiMr(fmi-."*""  RKfT.boU.O); 
laad  aem.  PANELJIESSAGC 

?ANELLABELSnUNa  ' 

"  ■  Ok 

paKi  atm.  jaaKpaMl.  PANEL  MESSAGE, 

FaNEL  LABEL  STOING."  WHAT  IS  TOE  GENERIC  INTELLIGEXCE  PROCESSOR  (GIP)  7 
"  ■  Ok 

paaljcicae  iieni(paai.  PANEL  MESSAGE. 

PANEL  LABEL  SIRING.  "  '  '  > 

Ok 

paid  aeae  aon^aad.  PANEL  MESSAGE, 

PANEL  LABEL  SIRING.  ‘ 

Ok 

paid  ocae  icmfpmtd,  PANEL  MESSAGE, 

PANEL  LABEL  SIRINC.  *  =>  A  TOOLKIT  FOR  DEVELOPING.  TESTING  AND  DEUVERING 

-  -  Ok 

paid  cteae  aeoKjpaid.  PANEL  MESSAGE. 

FaNEL  LABEL  SIRING."  MESSAGE  PROCESSING  APPUCATIONS 

0): 

P«nri  cieae  aemlpaid,  PANEL  MESSAGE. 

FaNEL  LABEL  SIRING," 

Ok 

paid  atmc  iianQiaid.  PANEL  MESSAGE 
PANEL  LABEL  STRING.  * 

Ok 

paid  aeae  iian(paid.  PANEL  MESSAGE 
FaNEL  LABEL  SIRING. " 

Ok 

paid  aeae  iieni(paML  PANEL  MESSAGE 

FaNEL  label  SIRING. "  IT  CONSISTS  OF:  ", 

0); 

paid  aeae  iiem^paid.  PANEL  MESSAGE 

PANEL  LABEL  SIRING. "  -  -  > 

Ok 

paid  aeae  iient^und,  PANEL  MESSAGE 

PANEL  LABEL  STRING. "  '.  0); 


GIP-.Ok 


paid  aeae  iian(paKL  PANEL  MESSAGE 

PANEL  LABEL  STRING."  =>  USER  SELECTABLE  INPUT  SOLRCES  ", 

Ok 

paid  aeae  iiemlpaid.  PANEL  MESSAGE 

PANEIT LABEL  STRING,"  =>  AUTOMATICVUSER  ASSISTED  INFORMATION  EXTRACTION  ", 

0); 

paid  aeae  iionlpaiel,  PANEL  MESSAGE 

PANEL  UBEL  STRING. "  =>  AUTOMATIC  OUTPUT  FORMATTING  FOR  DEVELOPMENT  SYSTEMS 

0); 

paid  aeae  uem(paiel,  PANEL  MESSAGE 

PANEL"  UBEL  STRING. "  =>  unUTIES  FOR  DEVELOPINCyTESTING  NEW  PROCESSING  ALGORmiMS". 

0): 

oaid  aeae  iiem(paiel,  PANEL  MESSAGE 

panel"  UBEL_STRING."  ’.0); 

window_fil(paidk 
window  man  loop(frane); 

! 
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gip 


/ *  /usr /etc/helpgip . c 

Peter  Dey 

This  program  prints  out  hints  for  running 


#  include  "stdio.h" 

#  include  "stdlib.h" 


mainO 

0 

\ 

printf("l.  Login:  gip  \n"); 
printf("2.  Wait  &  Click  as  directed  \n"); 

printf("  (Execute  demol  from  shell  tools,  optionally)  ■  \n") ; 
printf{"  (Execute  remoteacquire  on  CONSOLE,  optionally)  \n") ; 
printf("3.  Click  on  PROJECT  -  Open  &  Accept  \n"); 
printf("4.  Click  on  DEVELOPER  -  Open  &  Accept  \n”); 
printf("5.  Click  on  DEVELOPER  -  Layout  \n”); 

printf("6.  Click  on  ACQUIRE  -  (Change  Fact  Base  Name,  optionally)  \n") 
printf("7.  Click  on  CONTROL  -  Run  \n''); 

printf("  (Execute  remotenlp  on  CONSOLE,  optionally)  \n") ; 
printfC'S.  TO  QUIT:  Click  on  LADIES  -  Quit  \n''); 
printfC  Click  on  Gray  -  exit  \n'') ; 

} 
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iVIILUMETER-WAVE  NOISE  MODELING  INVESTIGATION 


ABSTRACT 

A  new  method  is  described  that  provides  valid  mm-wave  noise  models  for 
field  effect  transistors,  including  MESFETs  and  HEh^s.  This  method  avoids  the 
complications  of  a  variable  impedance  tuner,  and  requires  only  the  knovnedge 
of  a  small  signal  equivalent  circuit,  and  noise  figures  measured  across  a  range 
of  frequencies  for  a  single  known  source  impedance.  Noise  parameters 
derived  from  this  method  are  shown  to  agree  well  with  those  obtained  from 
tuner  based  measurements.  A  review  of  previously  reported  noise  modeling 
techniques,  summarized  here,  reveals  either  their  common  dependence  on 
tuner  based  measurements  or  the  use  of  approximations  that  are  not  valid  at 
mm-wave  frequencies. 
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MILLIMETER-WAVE  NOISE  MODELING  INVESTIGATiON 


L  INTRODUCTION 

it  is  well  known  that  the  use  ot  a  low  noise  amplifier  (LNA)  at  the  front  end  of  a 
mm-wave  radar  or  communications  receiver  can  improve  system  sensitivity  and 
dynamic  range  as  compared  tc  the  use  of  down  conversion  with  a  mixer 
immediately  following  the  antenna.  LNAs  are  now  possible  at  frequencies  as 
high  as  W-band  (75-100GHz),  using  state-of-the-art  High  Electron  Mobility 
Transistors  (HEMTs).  However,  accurate  transistor  noise  characterization  is 
critical  to  achieving  good  millimeter-wave  (mm-wave)  LNA  performance,  while 
avoiding  the  widely  used  process  of  hand  tuning.  Such  hand  tuning  for  noise 
performance,  costly  and  time  consuming  for  hybrid  amplifiers,  is  simply  not 
practical  for  millimeter-wave  monolithic  microwave  integrated  circuits  (MMICs). 


As  illustrated  in  Figure  1 ,  FET  noise  characterization  can  take  the  form  of 
measured  noise  parameters  ("Black  Box  Methods")  or  a  noise  model  ("FET 
Specific  Methods"),  from  which  noise  parameters  can  be  calculated.  A 
knowledge  of  four  noise  parameters  allows  for  a  calculation  of  the  device’s 
noise  figure  for  a  given  complex  input  terminating  admittance,  according  to  [7]: 


(1) 


where  F  is  the  noise  figure  of  the  two-port  device,  Ys  is  the  input  or  source 
admittance  presented  to  the  device,  and  Gopt,  Bopt,  Fmin.  and  Rn,  are  the  four 
noise  parameters  defined  as  follows: 
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Y  ““  G  +  jB 

op*  ~  opt  ^  of<  is  the  optimum  input  admittance^ 

Finn  is  the  minimum  noise  figure  achieved  for  Ys  =  Yopt 

Rn  is  called  the  equivalent  noise  resistance,  and  describes  how  rapidly  the 

noise  figure  degrades  as  the  source  admittance  moves  away  from  Yopt . 

Noise  parameter  measurements  are  relatively  routine  at  microwave 
frequencies,  using  conventional  techniques  [1-4].  However,  progress  has  been 
slow  towards  the  implementation  of  mm-wave  noise  parameter  measurements. 
This  is  mainly  due  to  the  difficulty  with  the  fabrication  and  characterization  of 
variable  impedance  tuners  at  mm-wave  frequencies  as  discussed  in  Section  II. 

The  present  research  has  explored  FET  specific  noise  model  methods  (Figure 
1)  with  the  goal  of  providing  an  alternative  or  complementary  mm-wave  FET 
noise  characterization  method  to  those  relying  on  tuner  based  measurements. 
Accordingly,  an  overview  of  FET  noise  modeling  methods  is  given  in  Section  ill. 
Also  described  (Section  IV)  is  a  new  method  developed  for  deriving  valid  mm- 
wave  FET  noise  models  through  the  use  of  an  equivalent  circuit  and  a  set  of 
noise  figure  measurements  over  frequency.  Figure  2  shows  the  model  topology 
used. 


II.  OBJECTIVES  AND  ACCOMPLISHMENTS 

At  the  start  of  the  summer's  work,  a  short  "Objectives  and  Summary  Report 
containing  the  following  information  was  submitted  to  our  Air  Force  sponsors. 
This  provided  the  template  for  the  Summer's  work. 
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PROBLEM:  Conventional  microwave  noise  modeling  methods  utilize  a 
variable  impedance  (or  admittance)  tuner,  which  presents  a  sequence  of 
"known"  source  impedances  Zs  (admittances  Ys)  to  a  device's  input  terminals. 
The  tuner  must  be  able  to  present  a  range  of  impedances  that  represent  a  good 
coverage  of  the  Smith  Chart,  including  near  the  edges  of  the  chart.  Hence,  the 
tuner  must  have  very  low  loss.  The  ability  to  fabricate,  and  characterize,  such  a 
tuner  becomes  increasingly  more  difficult  as  frequency  is  increased.  Research 
and  commercial  developments  are  being  pursued  in  the  industry  and  at  the  Air 
Forced  implementing  mm-wave  tuners  for  noise  parameter  measurements. 
Because  of  the  inherent  difficulties  with  this  approach,  however,  it  is  essential 
that  alternatives  be  explored  to  the  reliance  on  a  variable  impedance  tuner  for 
mm-wave  FET  noise  characterization. 

SCOPE  OF  RESEARCH:  This  research  consisted  of  a  ten  week  effort 
involving  the  work  of  two  researchers.  The  work  involved  a  paper  study, 
analytical  research,  and  numerical  experiments.  The  numerical  experiments 
combine  analytical  research  with  existing  measurement  data  in  an  attempt  to 
explore  the  validity  of  a  new  noise  modeling  approach  (see  Section  IV ).  Some 
of  the  numerical  experiments  were  carried  out  by  modifying  existing  computer 
programs  generated  by  a  previous  Summer  Research  Associate.  A  new 
computer  program  was  also  generated  as  part  of  this  work.  Because  of  the  time 
constraints,  no  attempt  was  made  to  perform  measurements  for  use  with  this 
study,  instead  existing  data  was  made  available  to  us  from  the  industry. 

SPECIFIC  OBJECTIVES  VS.  ACCOMPLISHMENTS:  The  work 
described  above  was  carried  out  as  planned  and  all  of  the  objectives  of  the 
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work  were  satisfied.  This  degree  of  accomplishment  was  aided  by  extensive 
interaction  with,  Mr.  Rick  Webster.  This  included  short  weekly  reports  and 
weekly  meetings. 

The  research  was  divided  into  several  tasks  and  accomplished  as  follows: 

1 )  OBJECTIVE  -  Review  and  assess  the  previous  MESFET  noise  modeling 
work  of  Mr.  Bill  Patience  [5].  as  performed  under  the  1989  AF  Summer 
Research  Program. 

ACCOMPLISHMENT  -  After  verifying  the  theoretical  derivations  of  this  work,  an 
assessment  was  made  with  regard  to  the  theoretical  assumptions  made, 
including  that  of  frequency  independence  for  noise  source  coefficients  (P.R, 
and  C)  used  with  the  model  of  Figure  2.  This  assessment  was  made  from 
theoretical  considerations,  and  numerical  case  studies  derived  from 
measurement  data  (data  which  was  not  available  to  Mr.  Patience  at  the  time  of 
his  study).  See  Section  IV^. 

2)  OBJECTIVE  -  Explore  improvements  and  alternatives  to  the  use  of  tuner 
based  measurements  for  noise  characterization. 

ACCOMPLISHMENT  -  A  paper  study  was  conducted  to  formulate  an  overview 
of  various  noise  modeli-  lods  documented  in  the  literature  (Section  III). 
This  study  proposes  a  modification  to  the  work  of  Patience  as  the  most 
promising  mm-wave  noise  modeling  method,  that  does  not  require  tuner  based 
measurements.  A  related  new  computer  program  was  generated. 
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3)  OBJECTIVE  -  Prepare  a  final  report  in  accordance  with  AFOSR  Summer 
Research  program  guidelines,  and  a  "mini-grant"  proposal  for  a  follow-on 
research  project. 

ACCOMPLISHMENT  -  The  present  document  satisfies  the  final  report 
requirements,  and  a  related  oral  presentation  was  given  at  Hanscom  AFB  July 
12, 1 991 .  Because  this  work  is  of  interest  to  other  Air  Force  Laboratories, 
presentations  of  the  work  are  scheduled  to  be  given  at  Griffiss  AFB  July  22, 

1991,  and  at  Wright  Patterson  AFB  August  15, 1991.  The  completion  of  the 
mini-grant  proposal  is  forthcoming. 

4)  OBJECTIVE  -  Prepare  a  draft  for  summary^  version  of  a  research  oaoer 
aimed  at  publication  in  a  trade  journal  (e.g.  IEEE  Trans,  on  Microwave  Theory, 
and  Tech.),  a  trade  magazine  (e.g.  Microwave  Journal),  or  a  conference  digest 
(e.g.  IEEE  MTT’S  Symposium  Digest). 

ACCOMPLISHMENT  -  An  abstract  for  a  paper  entitled  "A  New  Millimeter-wave 
Noise  Modeling  Method  for  Field  Effect  Transistors,"  was  submitted  to  the  1 991 
IEEE  International  Electron  Devices  Meeting  (lEDM).  If  accepted,  an  expanded 
four  page  paper  will  be  required  for  the  conference  digest.  It  is  also  expected 
that  a  second  paper  will  be  generated  that  gives  a  comprehensive  review  of 
FET  noise  modeling  methods. 

III.  A  BRIEF  OVERVIEW  OF  FET  NOISE  MODELING  METHODS 
A  viable  approach  for  mm-wave  noise  characterization  of  FETs  is  to  use 
knowledge  of  an  equivalent  circuit  for  the  FET  to  reduce  the  required  noise 
measurement  complexity.  Several  such  FET  specific  methods  have  been 
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proposed  in  the  literature  (Figure  1).  A  brief  summary  of  these  methods  is 
given  here. 

The  reported  FET  noise  modeling  methods  [7-30]  use  various  approximations, 
but  follow  from  a  common  theoretical  formulation.  This  theoretical  formulation 
is  based  upon  the  fundamental  linear  two  port  noise  theory  developed  by 
Rothe  and  Dahike  [6]  and  further  developed  by  an  IRE  Subcommittee  on  Noise 
[7].  This  two-port  noise  theory  applies  to  both  Black  Box  noise  parameter 
measurements  and  FET  Specific  Noise  Models  as  indicated  in  Figure  1. 

EARLY  FET  NOISE  THEORY:  The  early  groundwork  for  FET  noise 
modeling  was  laid  by  van  der  Ziel  [8-10] .  van  der  Ziel  [13],  identified  the  main 
source  of  noise  in  FETs  as  thermal  noise  in  the  channel.  He  derived  useful 
analytical  expressions  for  the  intrinsic  gate  and  drain  current  sources  (ing  and 
ind)  and  the  correlation  coefficient  (C)  between  them.  Under  his 
approximations,  C  was  found  to  be  imaginary.  Klaassen,  in  a  slightly  more 
general  formulation  [11],  found  C  to  be  complex  in  general  with  a  small  real 
part,  which  he  attributed  to  high  frequency  gate-channel  coupling  (neglected  by 
van  der  Ziel).  Nonetheless  van  der  Ziel’s  work,  complete  with  imaginary  C,  has 
provided  the  format  for  nearly  every  noise  modeling  solution  to  follow. 

IMPORTANT  NOISE  THEORY  EXTENSIONS:  Important  extensions  to 
this  early  work  were  made  by  Baechtold  [12-13],  Baechtold  advanced  a 
complete  noise  modeling  procedure,  that  relies  on  frequency  independent 
noise  coefficients  P,  R,  and  C  that  follow  directly  from  van  der  Ziel's  relations  [8- 
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9].  The  intrinsic  noise  sources  are  expressed  in  terms  of  P,R,  and  C  through 
the  following  relations: 

t  =4kTA1g„P  j  4kTAfC^0)2 

^ - 9^" 

This  same  noise  source  description  is  used  in  the  present  research  (Figure  2). 
in  Baechtoid's  method,  noise  parameters  are  calculated  using  a  small  signal 
model,  fitted  to  measured  S-parameters  combined  with  analytically  determined 
noise  coefficients  P,R  and  C. 

Another  very  important  contribution  was  made  by  Pucel  et.  al.  [14].  This  wori , 
covers  all  aspects  of  MESFET  operation,  including  DC,  small  signal,  and  noise 
characteristics.  Pucel  et.  al.  include  a  review  is  given  of  prior  work  in  each  of 
these  areas  and  point  out  that  the  previous  FET  noise  theory  (including  that  of 
van  der  Ziel  and  Baechtold)  fails  to  adequately  account  for  velocity  saturation 
effects.  The  authors  proceed  to  use  a  two  section  velocity  field  model,  valid  in 
the  saturation  region  to  derive  expressions  for  P,R,  and  C  and  for  the  four  noise 
parameters.  This  method,  like  that  of  Baechtoid’s,  is  not  directly  applicable  for 
accurate  millimeter-wave  noise  characterization  due  to  the  omission  of  parasitic 
elements  Cdg  and  Rds  (see  Figuit-;  2)  from  the  analysis.  In  addition,  this  model 
shares  the  limitations  of  other  purely  analytic/numerical  models  discussed 
below. 
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The  remaining  FET  noise  model  work  (Figure  1)  has  been  placed  into  three 
categories:  1)  Analytical/Numerical  Noise  Models,  2)  Semi-empirical  Noise 
Models,  and  3)  Physically  Pased  Theoretical/Experimental  Models. 

ANALYTICAL  /NUMERICAL  NOISE  MODELS:  The  first  category  (Figure 
1)  includes  the  work  of  van  der  Ziel,  Baechtold,  and  Pucel  et.  al.,  as  well  as  the 
subsequent  work  of  Gappy  et.  al.  [15-16],  Heinrich  [17-18],  and  Brookes  [19]. 
These  methods  apply  basic  physics  to  calculate  noise  properties,  using  as 
inputs  the  geometry  and  DC  operating  conditions  of  the  FET.  Such  models  are 
very  useful  for  establishing  physically  meaningful  model  topologies  and 
studying  trends  in  electrical  behavior  with  changes  in  geometry  and  operating 
conditions.  However,  analytic/numerical  methods,  by  themselves,  do  not 
produce  the  most  accurate  FET  models  for  circuit  design.  This  is  due  to  the 
many  non-ideal  conditions  existing  in  practical  FETs  including,  for  example, 
variations  in  the  geometrical  dimensions  and  doping  levels.  This  information 
is  also  seldom  available  to  the  circuit  designer.  Hence,  a  practically  useful  FET 
noise  model  requires  that  at  least  some  of  the  noise  model  parameters  be 
determined  through  the  use  of  measured  data. 

SEMI-EMPIRICAL  NOISE  MODELS:  The  second  category  of  noise 
models  (Figure  2)  uses  measured  data  to  determine  empirical  noise  fitting 
factors.  These  fitting  factors  are  used  along  with  a  simplified  equivalent  circuit 
to  estimate  noise  parameters.  Most  notable  of  these  are  the  models  proposed 
by  Fukui  [20-22],  and  Podell[23].  These  models  are  useful  in  that  they  provide 
convenient  closed  form  relations  between  dominant  geometrical  or  equivalent 
circuit  parameters  and  noise  performance,  yet  because  of  the  many 
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approximations  made  they  are  not  weii  suited  for  mm-wave  FET  noise 
characteiization. 

PHYSICALLY  BASED  THEORETICAL/EXPERIMENTAL  MODELS: 
The  third  category  (Rgure  1)  contains  .  most  promising  methods  for  mm- 
wave  applications.  This  work  indudes  the  methods  of  Gupta  et.  ai.  [24-25], 
Pospieszalski  et.  al.  [26-27],  Robertson  ei  al.  [28-30],  and  Riddle  [31].  All  of 
these  methods  assume  (or  equivalently  assume)  that  the  coeffidents  P.R.  and  C 
are  independent  of  frequency  and  rely  on  the  use  of  measured  S-parameters  to 
derive  a  small  signal  equivalent  circuit.  The  methods  differ  in  the 
charaderization  of  the  noise  sources  within  the  model,  in  the  simplifying 
assumptions  and  the  exad  model  topologies. 

Gupta  et.  ai.  [24-25]  use  a  simplified  model  that  only  requires  the  output  noise 
power  spectral  density  that  is  determined  through  an  output  noise  power 
measurement  made  at  a  single  low  frequency  (e.g.  1-2GHz).  Pospieszalski  et. 
al.  [26]  use  a  more  general  model  that  requires  a  set  of  noise  parameters  at  a 
single  frequency  to  characterize  equivalent  temperatures  Tg  and  Td  used  to 
represent  the  gate  and  drain  noise  sources  respectively.  Robertson  et.  al.  [28- 
30]  also  use  a  fairly  general  model,  similar  to  that  of  this  research  (Figure  2). 
They  determine  the  noise  coefficients  P.R,  and  C  by  fitting  an  equation  for  Fmin, 
given  in  terms  of  P,R,  and  C  to  a  set  of  measured  Fmin  values  determined  for  a 
number  of  frequencies  (>=4).  Most  recently,  Riddle  [31]  presented  a  means  to 
directly  extract  P,  R,  and  C  values  for  a  model  similar  to  Figure  1  by  using 
sequential  matrix  manipulations  of  the  measured  single  frequency  noise 
parameters  and  S-parameters. 
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Of  these  "Physically  Based  Theoretical/Experimental  Noise  Models,"  methods, 
the  most  extensively  implemented  and  verified  is  that  of  Pospieszalski's  [27], 
and  this  method  has  been  applied  to  develop  noise  models  for  use  at 
cryogenic  temperatures  [26].  Nonetheless,  while  each  of  the  methods 
discussed  have  merit,  due  to  various  approximations  they  are  generally  not 
valid  at  millimeter-wave  frequencies,  or  if  valid  require  tuner  based 
measurements  in  order  to  characterize  the  internal  noise  sources  of  the  model. 

tv.  A  NEW  MILLIMETER-WAVE  NOISE  MODELING  METHOD 

The  method  advanced  as  part  of  the  present  research,  overcomes  the 
difficulties  of  the  above  methods.  The  new  method  makes  use  of  noise  figure 
data  measured  over  a  range  of  frequencies  along  with  the  equivalent  circuit 
parameters  to  determine  three  frequency  independent  noise  coefficients  (P.R, 
and  C).  This  technique  avoids  unnecessary  simplifications  to  the  equivalent 
circuit  model  and  remains  valid  at  millimeter-wave  frequencies. 

METHOD  OF  PATIENCE:  The  new  method  builds  on  the  method  described 

by  W.  Patience  [5],  which  is  illustrated  in  the  algorithm  of  Figure  3.  The  method 

requires  a  small  signal  model,  derived  from  S-parameter  measurements,  and 

noise  figure  data  taken  over  a  range  of  frequencies  for  a  single  known  source 

reflection  coefficient  condition  Fs,  also  measured  over  frequency.  Central  to  the 

algorithm  is  an  equation  for  the  noise  figure  expressible  in  the  following  form: 

F  =  M,  +  M,  R  +  M,  P  +  M,  Cf^ 

12  3  4  (Oj 

where  Mj,  i=  1,2,3,  4,  are  functions  of  the  known  equivalent  circuit  parameters, 
and  measured  values  of  Fs,  corresponding  to  each  measured  value  of  noise 
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figure  supplied.  The  unknowns,  P,  R,  and  C,  are  determined  using  a  least 
squares  fitting  algorithm  [2],  equation  (3),  and  the  measured  data.  Once  they 
have  been  determined,  a  synthesized  tuner  based  "measurement"  procedure 
computes  the  noise  parameters  at  any  desired  frequency.  The  noise  figure  and 
reflection  coefficient  data  required  by  this  algorithm  can  readily  be  obtained 
from  on-wafer  or  fixtured  measurements  even  at  millimeter-wave  frequencies. 
An  example  of  an  applicable  mm-wave  measurement  system  and  procedure  is 
described  by  Dunleavy  [32]. 

EXAMINATION  OF  THE  ALGORITHM:  The  present  research  included  a 
thorough  investigation  of  the  method  of  Figure  3,  through  both  theoretical 
verification  and  numerical  experiments  that  utilize  measured  data  to  investigate 
and  verify  the  method.  From  a  theoretical  point  of  view,  the  derivation  of 
Patience  was  found  to  be  sound,  except  for  one  assumption  that  the  term  Zcor 
(see  Roth  and  Dahike  [6])  was  assumed  to  be  equal  to  the  network  parameter 
Zii.  This  assumption  is  believed  to  be  an  unnecessary  constraint. 

Next,  the  Patience  method  was  investigated  on  the  basis  of  a  set  of  numerical 
experiments  made  using  measured  data  provided  from  industry  for  a  MESFET. 
The  measured  data,  the  output  from  a  commercially  available  noise  parameter 
measurement  system,  consisted  of  measured  noise  figures  with  corresponding 
values  of  Fs  at  12  frequencies,  for  16  different  source  states  per  frequency. 
The  algorithm  (Figure  3)  was  then  applied  in  two  different  ways  to  generate  the 
data  shown  in  Figure  4. 
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Figure  4a  shows  the  results  by  applying  the  method  using,  for  each  source 
state,  the  noise  figure  data  and  Fs  values  provided  versus  frequency.  Note 
that  for  perfect  data  all  of  these  curves  would  be  flat.  The  fact  that  they  are  not  is 
due  in  part  to  errors  in  the  measurement  data.  For  example,  the  data 
corresponding  to  source  state  2,  9,  and  16  of  Figure  4a  appears  questionable 
since  C  becomes  greater  than  unity"*.  This  illustrates  how  the  method  may  be 
used  to  complement  a  noise  parameter  measurement  procedure,  by  identifying 
problem  source  states.  Some  deviations  may  also  be  due  to  errors  in  the  small 
signal  model. 

In  Figure  4b  are  shown  the  results  of  applying  a  slightly  modified  algorithm  at 
each  frequency  that  uses  the  measured  noise  figures  and  Fs  values  provided 
as  a  function  of  source  state.  With  accurate  data,  the  experiment  of  Figure  4b 
can  be  used  to  investigate  the  frequency  independence  assumption  for  P,R, 
and  C.  Although  most  reported  noise  modeling  methods  have  required  or 
assumed  P,  R,  and  C  to  be  frequency  independent.  Some  numerical  work  has 
suggested  that  they  may  be  frequency  dependent  in  general  [15-1 8].  This  is  a 
subject  of  continuing  controversy^.^,  and  more  data  needs  to  be  examined 
before  reaching  any  definitive  conclusions. 

Despite  the  variations  in  the  P,  R,  and  C  values  observed  in  Figure  4,  the  results 
for  noise  parameters  determined  from  the  algorithm  of  Figure  3  agree  very  well 
with  measured  noise  parameter  measurements  as  shown  in  Figure  5.  This 
shows  considerable  promise  for  the  method's  application  to  develop  mm-wave 
noise  models  with  the  improvements  discussed  next. 
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METHOD  IMPROVEMENTS  AND  EXTENSIONS:  Based  on  this  research 
several  improvements  to  the  method  have  been  initiated.  These  include  the 
derivation  of  a  more  general  equation  for  the  noise  figure  that  does  not  assume 
Zcor  =  Zii.  The  results  of  this  derivation  have  now  been  implemented  into  the 
new  method  and  verified  with  related  software^.  Other  improvements  initiated 
are  the  replacement  of  the  synthesized  "measurement"  step  from  the  algorithm 
of  Figure  3  with  a  direct  calculation  of  noise  parameters,  and  the  ability  to  solve 
for  the  noise  source  coefficients  in  different  ways  depending  on  the  available 
measurement  data.  These  ways  include  the  use  of  multiple  source  state  data 
to  calculate  P,  R,  and  C  (as  implemented  in  Figure  4b),  the  use  of  Fmin  data 
versus  frequency  (similar  to  Robertson  et.  al.  [28-29]),  and  the  use  of  measured 
noise  parameters  at  a  single  frequency  or  at  multiple  frequencies.  This  last  set 
of  data  is  the  output  of  a  commercial  noise  parameter  measurement  system.  It 
follows  that  a  physically  based  model,  such  as  the  present  one,  fitted  to  the 
entire  set  of  available  data  may  provide  the  most  accurate  frequency 
extrapolatible  noise  model. 

Other  issues  to  be  considered  in  future  research  are  the  implementation  of  an 
improved  small  signal  model  extraction  algorithm,  and  the  application  of  the 
method  to  temperature  dependent  FET  noise  model  development.  The 
derivation  of  a  small  signal  model,  as  done  for  the  present  work,  from  a  single 
set  ol  S-parameters  at  the  operating  bias,  may  not  adequately  separate  the 
extrinsic  and  intrinsic  model  parameters.  To  address  this  problem,  several 
methods  have  been  proposed  to  determine  a  more  accurate,  physically 
meaningful,  small  signal  FET  model  (33-34).  The  University  of  South  Florida  is 
also  investigating  this  problem  as  part  of  a  separate  project.  In  future 
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developments  of  this  noise  modeling  method  an  improved  small  signal  model 
procedure  should  be  employed.  Another  area  of  strong  interest  to  both  the 
industry  and  the  Air  Force  is  the  development  of  temperature  dependent  noise 
models  for  use  in  CAD  and  for  use  in  predicting  and  explaining  device  noise 
behavior  at  temperatures  from  cryogenic  (e.g.  as  low  as  4K)  to  elevated 
temperatures  (e.g.  475K).  Consideration  should,  therefore,  also  be  given  to 
temperature  dependent  noise  model  development  in  future  research. 

V.  SUMMARY  AND  CONCLUSIONS 

This  research  project  has  been  very  successful  in  furthering  the  understanding 
of  mm-wave  noise  modeling  methodology.  All  of  the  research  goals  have  been 
satisfied. 

After  a  detailed  literature  review,  it  is  concluded  that  the  most  viable  methods 
for  mm*wave  FET  noise  modeling  use  a  physically  based  noise  equivalent 
circuit.  Conventional  noise  parameter  measurements  are  complicated  at  mm- 
wave  frequencies,  by  the  difficulty  of  fabrication  and  characterization  of  a  low 
loss  variable  impedance  tuner.  This  summer’s  research  has  advanced  a  new 
method  that  overcomes  this  difficulty  by  using  a  physically  based  noise 
equivalent  circuit  whose  internal  noise  sources  are  characterized  by  noise 
figure  measurements,  made  over  a  range  of  frequencies  without  the  need  for  a 
tuner.  The  new  method,  therefore,  may  be  used  as  a  stand  alone  mm-wave 
noise  modeling  tool.  Alternatively,  the  method  may  be  used  to  complement  a 
tuner  based  noise  parameter  measurement  procedure  in  the  following  two 
ways.  First,  it  can  be  used  as  a  means  to  examine  the  integrity  of  the  noise  data 
corresponding  to  a  particular  source  impedance  tuner  state.  Second,  it  can  be 


17-17 


used  as  a  means  to  use  the  available  measured  noise  data  from  the  tuner 
procedure  to  derive  a  convenient  frequency  extrapoiatible  model. 


This  method  builds  upon  the  work  of  a  previous  AFOSR  summer  research 
project.  An  investigation  of  this  previous  work  produced  promising 
comparisons  to  measured  data,  but  also  revealed  that  modifications  are 
necessary  for  reliable  mm-wave  model  development.  A  new  computer 
program  that  incorporates  some  of  these  improvements  has  been  developed 
and  verified.  Work  towards  implementing  the  remaining  improvements  has 
been  initiated.  It  is  strongly  encouraged  that  additional  research  be  performed 
to  more  fully  develop  and  verify  the  method  advanced  here,  and  to  broaden  its 
range  of  applicability  to  include  temperature  dependent  FET  noise 
characterization. 
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FOOTNOTES 


1  Equation  (1)  is  also  expressible  in  terms  of  source  impedance  or  in  terms  of  source 
reflection  coefficient,  in  which  case  the  optimum  source  impedance  and  reflection 
coefficient,  respectively,  are  the  parameters  of  interest. 

2  Personal  communication  with  R.  Webster,  Rome  Laboratory,  April  8,  1991. 

2See  also  ”R&D  Record  Notebooks*  of  L  Dunleavy,  and  S.  Lardizabal.  A  separate  report 
is  also  being  prepared  to  detail  the  results  of  the  numerical  experiments. 

^  The  noise  sources,  on  a  physical  basis,  cannot  be  more  than  100%  correlated. 

5  Personal  communication  with  R.  Pucel,  A,  Riddle,  W.  Heinrich  at  1991  IEEE  MTT-S 
conference. 

60n  another  controversial  point,  Riddle  [31]  claims  that  1/f  noise  produced  a 
significant  influence  on  his  microwave  noise  parameter  calculations.  In  subsequent 
conversations  with  L  Dunleavy,  both  R.  Pucel  and  M.  Pospieszalski  find  this  surprising 
and  somewhat  questionable,  as  1/f  noise  is  widely  believed  to  be  entirely  negligible  at 
microwave  frequencies.  We  may  wish  to  re-address  this  question  in  future  research. 
^Resu  ls  and  comparisons  are  discussed  in  the  separate  report  mentioned  above  detailing 
the  numerical  experiments. 
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Figure  1 .  Flow  chart  showing  the  relationship  between  various  reported  noise  characterization 
methods.  All  of  the  methods  rely  on  the  linear  active  two  port  noise  theory  developed  by  Roth 
and  Dahike. 


Figure  2.  Noise  equivalent  circuit  used  in  the  present  research.  Like  similar  circuits  used  by 
others,  this  circuit  adds  equivalent  noise  sources  to  a  small  signal  FET  model. 
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Rgure  5.  Comparison  of  noise  parameters  calculated  for  a  MESFET  using  the  present  modeling  method  to 
measured  noise  parameters  obtained  with  a  tuner  based  procedure. 
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Abstract 

This  research  involved  increasing  the  capabilities  of  the  Sinusoidal  Transform  Coder 
(STC),  a  high  quality,  low  bit  rate  speech  coder.  The  new  methods  developed  in  the  course 
of  this  work  provide  a  means  for  the  compression  of  digital  speech  data  for  applications 
such  as  voice  mail.  Previous  methods  suitable  for  such  applications  allow  only  fixed 
compression  ratios;  that  is,  given  a  specified  amount  of  input  data,  the  stored  file  size  must 
be  a  certain  (smaiier)  fixed  size.  The  speech  quality  is  also  fixed  when  the  traditional 
compression  methods  are  used.  The  new  algorithms  allow  quality  to  be  balanced  against 
memory  requirements,  so  that  substantially  greater  compression  can  be  attained,  albeit  at 
lower  quality.  In  the  new  method,  parameter  space  transformations  are  performed,  so  that 
the  speech  quality  is  high  for  a  given  compression  ratio,  as  compared  to  alternate  methods. 
The  compression  can  be  performed  in  multiple  stages  of  any  selected  size,  extending  the 
previous  parameter  transformation  techniques.  Other  results  of  this  research  are  algorithms 
which  provide  a  method  for  the  reconstruction  of  lost  packets  of  digital  voice  data  for 
various  loss  environments  were  developed.  The  new  algorithms  allow  very  high  packet 
loss  rates  to  be  endured,  although  some  slight  reduction  in  quality  as  compared  to  the 
equivalent  unimpaired  speech  might  be  experienced.  These  techniques  can  be  applied  to 
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packet  or  land  mobile  radio  environments,  or  a  combination  of  both.  They  can  also  be  used 
in  conjunction  with  traditional  error  control  coding  methods  to  provide  an  exceptionally 
high  degree  of  robustness  in  the  presence  of  network  and  channel  errors. 

Introduction 

This  report  describes  research  performed  during  the  AFOSR  Summer  Research  Program 
related  to  low  bit  rate  speech  coding  using  sinusoidal  representations.  This  work  involved 
modifications  and  extensions  to  the  Sinusoidal  Transform  Coder  (STC)  developed  by  R.  J. 
McAulay  at  MIT  Lincoln  Laboratory  under  the  support  of  the  Rome  Laboratories  Digital 
Speech  Laboratory  [6, 8, 9, 10].  The  STC  modifications  and  extensions  provide  significant 
increases  in  flexibility  and  capability,  particularly  in  harsh  communications  environments 
where  congestion  and  errors  are  common. 

This  v'ork  was  primarily  focussed  in  two  separate,  albeit  somewhat  related,  areas.  The 
first  of  these  areas  is  the  multi-stage  transformation  of  encoded  speech  from  one  rate  to 
another  lower  rate.  The  second  area  is  the  reconstruction  of  missing  packets  of  speech  data. 
These  areas  of  research  are  related  by  the  method  of  solution,  that  is,  manipulation  of  the 
parameteric  (as  opposed  to  waveform)  representation  of  tlie  speech  signal. 

Discussiou  of  Research  Problems 

Voice  communications  systems  can  be  evaluated  on  the  basis  of  many  interrelated 
criteria,  including  intelligibility,  perceived  quality,  and  environmental  robustness.  Sev¬ 
eral  of  these  measures  are  clearly  more  significant  than  others,  but  the  fact  remains  that 
speech  coding  algorithms  do  not  exist  in  isolation  from  the  entire  voice  communications 
system.  It  is  therefore  important  that  flexibility  of  application  and  robustness  to  severe  data 
transmission  environments  be  considered  when  evaluating  speech  coding  methods. 
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Speech  Data  Compression  with  Sinusoidal  Representations 

A  variety  of  voice  communications  systems  require  data  rate  compression  of  the  speech 
signal.  Approaches  to  this  problem  can  be  separated  into  two  categories,  algorithms  which 
make  few  assumptions  about  the  nature  of  the  input  data,  such  as  adaptive  Huffman  or 
Lempel-Ziv-Welsh  (LZW)  coding,  and  those  which  are  optimized  for  the  expected  data,  in 
this  case  speech,  such  as  ADPCM  and  LPC-based  coders.  The  first  type  of  compression 
technique  reduces  the  memory  storage  requirements  with  no  information  loss,  while  the 
second  type  involves  some  reduction  in  information  as  compared  to  the  original  data.  Both 
types  of  coders  reduce  the  memory  usage  by  a  fixed  percentage.  The  method  presented 
here  allows  for  memory  storage  to  be  balanced  against  speech  quality. 

The  basic  coding  technology  used  is  the  Sinusoidal  Transform  Coding  (STC)  speech 
compression  method  [6,  7,  8,  9,  10].  The  STC  model  has  been  shown  to  be  effective 
at  various  discrete  rates  from  8  kbps  to  2.4  kbps  [9].  Further,  this  coding  technology  is 
extremely  robust  to  manipulation  in  the  parameter  space,  as  shown  by  the  previous  work 
in  STC  parameter  space  transformations  [1].  The  parameter  space  transformation  method 
allows  the  most  optimum  coding  for  this  coder  type  to  be  used  for  a  given  compression 
ratio.  The  technique  presented  here  allows  compression  in  multiple  stages  of  any  selected 
size,  with  no  interaction  with  the  source. 

Speech  Packet  Data  Loss 

Telecommunications  networks  are  becoming  increasingly  diverse  with  the  introduction 
of  new  services,  such  as  digital  mobile  radio,  and  architeetmes,  such  as  ATM-based  packet 
networks,  to  serve  a  variety  of  needs  as  shown  in  Figure  1.  The  heterogeneous  nature 
of  such  networks  implies  that  the  representation  which  will  be  used  for  speech  signals 
must  be  robust  in  the  presence  of  all  degradations  that  may  be  encountered  in  the  path 
from  source  to  destination.  This  work  indicates  that  parametric  coders  using  sinusoidal 
representations  can  be  used  in  extremely  harsh  packet  loss  environments,  even  beyond 
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Rgurc  1:  Heterogeneous  Packet  and  Mobile  Environment 

those  which  degrade  PCM  speech  to  unacceptable  levels.  Although  this  work  is  primarily 
directed  at  data  loss  recovery  in  packet  networks,  the  techniques  described  here  can  be  used 
for  signal  reconstruction  when  fades  or  bit  errors  which  arc  not  corrected  by  conventional 
error  control  coding  occur  in  a  digital  mobile  radio  environment 

The  reconstruction  of  lost  packets  containing  64  kbps  PCM  and  32  kbps  ADPCM 
speech  data  has  been  studied  in  relation  to  anticipated  congestion  problems  in  the  backbone 
packet-based  nctwoik,  where  excessive  delay  or  buffer  overflow  can  result  in  packet  discard 
[2,  3, 4, 5, 11,12, 13}.  TTiis  prior  work  has  produced  methods  for  leconstnicrion  of  PCM 
packets,  with  acceptable  uniform  loss  rates  up  to  approximately  6%  with  16  ms  packets. 
Reconstruction  of  ADPCM  using  similar  strategies  has  proved  more  difficult,  due  to  the 
adaptive  nature  of  the  coder. 

The  basic  coding  technology  used  here  is  the  Sinusoidal  Transform  Coding  (STQ 
meihod  16,  8,  9,  10].  The  .STC  model  ha.s  been  shown  to  be  effective  at  various  discTcte 
rates  from  8  kbps  to  2.4  kbps  [9].  Testing  of  low  rate  coders  has  indicated  that  long  frame 
lengths  of  30-40  ms  arc  highly  usable,  which  suggests  that  the  STC  parameters  exhibit 
good  lime  stability.  Furtlicr,  this  cociing  technology  is  extremely  robust  to  manipulation  m 


the  paranicter  space,  as  shown  by  work  in  STC  parfimeter  space  transformations  [1].  The 
wo^  presented  here  is  primarily  focused  on  the  4.8  kbps  coder  which  might  be  applicable 
to  digital  mobile  environments. 

The  proposed  Broadband  ISDN  will  be  based  on  the  Asynchronous  Transfer  Mode 
(ATM)  standard,  wliich  defines  the  packet  data  field  size  to  be  48  bytes.  Due  to  delay 
considerations,  however,  it  is  anticipated  that  only  one  frame  (20  ms  for  our  4.8  kbps  STC) 
will  likely  be  included  in  a  single  ATM  packet.  The  results  of  this  research  are  based  on 
these  assumptions. 

Results 

Efficient,  Storage  Adaptive  Speech  Compression 

Because  of  the  robust  nature  of  the  representation  used  by  the  STC  algorithm,  not  only 
c  1  the  coder  operate  at  various  rates  [9],  but  transformations  in  the  parameter  space  can 
be  used  to  change  the  bit  rate  of  the  coded  data. 

The  parameter  transformation  teclinique  allows  the  coding  for  all  spectrum,  excitation, 
and  frame  length  parameters  to  be  selected  at  each  rate  to  give  the  optimum  coding  for  that 
rate.  Among  the  parameters  which  may  be  varied  at  each  rate  are  the  number  of  cepstral 
coefficients,  the  frequency  range  for  coding,  the  method  of  representation  (frame-fill  or 
the  actual  value)  of  the  mid-fraine  excitation  parameters,  and  the  number  of  bits  used  for 
each  parameter.  The  method  for  changing  the  spectral  representation  (that  is,  the  number 
of  cepstral  coefficients)  is  shown  in  Figure  2.  The  cepstral  coefficients  derived  from  the 
coded  parameters  are  used  to  derive  a  spectral  envelope,  whicit  is  then  used  to  generate 
a  new  set  of  cepstral  coefficients,  which  can  tlien  be  coded.  The  parameter  temporal 
interpolation  technique  used  for  frame  length  changes  at  each  stage  is  depicted  in  Figure 
3.  The  various  parameters  at  each  new  frame  boundary  (that  is,  at  TI^-dT^.T^+i),  afs 
linearly  interpolated  according  to  the  distance  of  the  new  frame  boundaries  from  the  closest 
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Figure  2:  Technique  for  Changing  the  Spectral  Representation 
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Figure  3:  Interpolation  of  Parameters  for  New  Frame  Lengths 


old  edges  (Tk_i,T!c,Tk+i,Tk+2).  Provision  is  also  made  for  interpolating  mid-frame  and 
frame-fill  parameters  in  a  similar  manner. 

The  method  presented  above  is  ideal  for  many  applications.  A  typical  application  for 
this  multi-level  compression  technique,  a  voice  mail  system,  was  created  to  test  the  overall 
concept.  The  structure  of  the  prototype  voice  mail  system  is  illustrated  in  Figures  4  and  5. 
The  mail  is  generated  by  a  sender  and  transmitted  to  the  recipient  via  a  computer  network,  in 
this  case  via  Ethernet  using  the  Sun  Microsystems  Remote  ih-ocedurc  Call  (RPC)  protocol, 
which  in  turn  uses  the  standard  TCP/IP  protocol  stack.  The  storage  system  processes  the 
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Recipient 


Figure  4:  Voice  Mail  Application 

arriving  packets  to  insure  that  message  consistency  is  maintained.  The  speech  data  is  then 
passed  to  the  STC  analyzer  to  code  the  data  in  the  STC  format,  if  it  is  not  already  using  that 
representation  (this  initial  encoding  provides  a  4-to-l  compression  over  the  standard  64 
kbps  ^-law  representauon).  After  the  initial  compression,  the  data  is  stored  on  disk.  The 
storage  management  logic  then  checks  the  storage  space  remaining  available,  and  performs 
transformations  as  necessary.  The  memory  management  control  logic  is  shown  in  Figure 
6.  The  simple  scheme  implemented  in  the  prototype  system  reduces  the  effective  bit  rate  of 
all  of  tlie  stored  messages  when  a  specified  threshold  is  reached.  In  this  manner,  the  total 
memory  usage  is  reduced,  with  corresponding  quality  reductions  in  all  of  the  messages. 
More  advanced  control  mechanisms,  such  as  transformation  of  previously  checked  or  old 
messages  before  new  messages,  can  be  applied  in  a  similar  manner. 

In  order  to  verify  that  the  transformation  was  producing  high  quality  speech,  intelligi¬ 
bility  tests  were  performed.  The  2.4  kbps  test  data  was  generated  by  performing  the  initial 
STC  analysis  at  a  16  kbps  rate  with  a  10  ms  frame  size.  The  rate  was  then  reduced  via  the 
transformation  to  8  kbps  with  a  20  ms  frame  length,  then  to  4.8  kbps  with  a  20  ms  frame, 
and  finally  to  2.4  kbps  with  a  30  ms  frame  length.  The  results  are  summarized  in  Table  1. 
Most  listeners  find  the  the  perceived  speech  quality  to  be  good,  only  slightly  less  pleasing 
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Figure  5:  Voice  Mail  Processing  System 


Figure  6:  Message  Compression  Logic 


18-8 


Table  1:  DRT  Test  Results 


Figure  7:  Packet  Loss  Recovcry/Reconstruction  System 


than  the  untransformed  2.4  kbps  STC. 

STC  Packet  Loss  Eeconstruction 

Missing  STC  parameter  packets  can  be  reconstructed  using  the  system  shown  in  Figure 
7.  The  most  straight-forward  method  is  to  use  the  parameters  received  in  the  previous 
packet  (parameter  freezing),  as  depicted  in  Figure  8.  Note  that  this  method  entails  no 
additional  delay  beyond  the  usual  analysis/synthesis  time.  Better  performance  can  be 
attained  by  using  the  packet  received  after  the  missing  packet  and  parameter  interpolation 
procedures.  This  method  is  illustrated  in  Figures  9  and  10.  Linear  interpolation  is  used  for 
the  spectrum  and  excitation  parameters  when  the  adjacent  frames  are  available.  If  more 
than  one  frame  in  a  row  is  missing,  the  closest  received  frame  is  used. 

In  order  to  test  the  effectiveness  of  these  methods,  the  STC  has  been  informally  tested  at 
a  variety  loss  rates,  where  the  missing  packets  are  uniformly  distributed.  The  performance 
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Figure  8:  Parameter  Freezing  for  STC  Packet  Loss  Reconstruction 
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Figure  10:  Parameter  Interpoiation  for  STC  Packet  Loss  Reconstruction 

of  the  STC  in  all  packet  loss  environments  was  outstanding.  At  12%  loss  rates,  which 
were  unacceptable  for  even  the  best  64  kbps  PCM  reconstruction  methods  [13],  the  4.8 
kbps  STC  was  only  slightly  degraded  when  parameter  freezing  reconstrucdon  was  used, 
and  the  loss  was  almost  unnodceable  when  interpolation  was  applied.  In  order  to  verify 
and  quantify  these  performance  results,  DAM  tests  arc  currently  being  performed  on  both 
the  freeze  and  interpolation  methods  with  12%  loss  rates.  The  best  PCM  reconstruction 
method  at  this  loss  rate  [12]  is  also  being  tested  for  comparison. 

Summary  of  Results 

This  research  has  produced  several  new  algorithms  which  provide  significant  increases 
in  speech  coder  utility.  These  results  have  been  described  in  the  following  papers: 

J.  B.  Evans  and  T.  G.  Champion,  “Robust  Speech  Coding  and  Reconstruction 
Techniques  for  Heterogeneous  Mobile/Packet  Networks”,  submitted  to  1992 
IEEE  Int.  Conf.  Acoust.,  Speech,  Signal  Processing. 

T.  G.  Champion  and  J.  B.  Evans,  “A  Multi-Rate  STC  Speech  Compression 
Technique  with  Applications  in  Voice  Mail”,  submitted  to  1992  IEEE  Int. 

Conf.  Acoust.,  Speech,  Signal  Processing. 

Further,  patent  disclosures  on  this  work  have  been  iued  with  the  U.S.  Government: 
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“A  Multi-Rate  STC  Speech  Compression  Technique  for  Speech  Data  Storage”, 
Joseph  B.  Evans  and  Terrence  G.  Champion. 

“STC  Speech  Packet  Reconstruction  Techniques  for  Heterogeneous  Mobile/Packet 
Network  Environments”,  Joseph  B.  Evans  and  Terrence  G.  Champion, 

Conclusion 

The  work  performed  under  this  program  has  resulted  in  new  algorithms  for  improved 
capability  for  speech  data  storage  and  for  high  quality,  low  rate  speech  coding  in  harsh 
communications  network  environments. 

A  new  multi-stage  speech  compression  algorithm  using  parameter  space  transforma¬ 
tions  was  presented.  This  new  method  allows  speech  quality  to  be  balanced  against  storage 
requirements,  while  maintaining  high  quality  for  each  particular  rate.  This  algorithm  al¬ 
lows  more  efficient  use  of  resourscs.  Specifically,  in  a  military  environment,  the  need  for 
voice  data  storage  resources  (more  message)  may  be  considerably  greater  during  times  of 
crisis  than  at  normal  times.  Some  quality  can  now  be  sacrificed  during  critical  periods,  as 
conditions  demand,  while  maintaining  high  quality  when  resource  usage  is  light. 

New  methods  of  packet  loss  recovery  for  low  bit  rate  speech  systems  were  developed. 
These  algorithms  allow  extremely  high  packet  (frame)  loss  rates  to  be  endured,  albeit  at 
some  slight  reduction  in  quality  as  compared  to  the  equivalent  unimpaired  speech.  These 
techniques  can  be  applied  to  both  packet  and  land  mobile  radio  environments.  When 
used  in  conjunction  with  traditional  error  control  coding  methods,  the  new  algorithms  can 
provide  an  exceptionally  high  degree  of  robusmess  in  the  presence  of  network  and  channel 
errors.  The  new  methods  provide  important  new  capabilities  for  the  military  environment 
by  vastly  increasing  tlie  usability  of  STC-based  voice  communications  in  harsh  jamming 
or  transmission  channel  noise  environments. 

Both  of  these  developments  exploit  the  parameter  space  robusmess  of  the  original  STC 
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representation.  Future  research  will  undoubtedly  uncover  additional  applications  arising 
from  this  characteristic  of  the  STC. 
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A  GENERAL  ONE-DIMENSIONAL  III-V  HETEROJUNCTION  DEVICE 


SIMULATOR 

Dr.  Ronnie  E.  Owens.  Assistant  Professor 


1  Introduction 

This  repor.  describes  a  general  one-dimensional  III-V  heterojunction  device  simula¬ 
tion  program  which  was  developed  by  the  author  during  the  1991  ten  week  summer 
appointment  with  Rome  Laboratories  at  Hanscom  Air  Force  Base.  Research  on  het¬ 
erojunction  devices,  as  well  as  the  related  materials  research,  are  important  ongoing 
efforts  which  will  lead  to  high-speed  digital  and  high-frequency  analog  circuitry  far 
outstripping  the  performance  of  silicon.  In  addition,  almost  all  optoelectronic  de¬ 
vices  in  use  today  are  based  on  the  III-V  heterojunction  technologies.  Modeling  is 
an  especially  important  part  of  this  research  and  can  be  crucial  in  device  design  and 
analysis.  The  program  described  herein  represents  an  effort  to  provide  a  tool  which 
will  be  useful  to  the  device  designer  and  analyst.  The  program  is  capable  of  simulat¬ 
ing  heterojunction  devices  composed  of  III-V  ternary  materials  with  compositional 
grading,  impurity  grading,  optical  excitation,  and  a  general  trap  model  intended  for 
use  in  modeling  the  InP/oxide  interface.  Significant  effort  was  expended  in  pro'/iding 
a  convenient  user  interface  to  the  program.  In  the  succeeding  sections  the  following 
topics  are  discussed:  the  physics  background  of  the  simulation  program,  a  description 
of  the  use  and  internal  structure  of  the  program,  some  examples  including  simulations 
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X  position  in  the  device  (perpendicular  to  layers) 

Nc  conduction  band  density  of  states 

Nv  valence  band  density  of  states 

Fi  Fermi-Dirac  integral  of  order  one-half 

V  electrostatic  potential 

X  electron  affinity 

On  (x)  quasi-Fermi  level  for  electrons 
Op  (x)  quasi-Fermi  level  for  holes 
n  electron  density 

p  hole  density 

g  degeneracy  factor 

Nd  donor  density 

Na  acceptor  density 

Edb  donor  binding  energy  (i.e.  relative  to  conduction  band  edge) 

Eab  acceptor  binding  energy  (i.e.  relative  to  valence  band  edge) 

e  permittivity 

Pn<Pp  electron  and  hole  mobility 

r„,  Tp  electron  and  hole  recombination  lifetimes 

J„,  Jp  electron  and  hole  current  density 

Un,  Up  electron  and  hole  recombination  currents 

G  optical  generation  rate 

hu  incident  radiation  energy 

Table  1:  List  of  Symbols. 


of  an  InP/InAlAs  power  MISFET,  and  some  suggestions  for  future  improvements. 


2  Background 

This  section  details  the  physics  background  for  the  simulator.  Table  1  provides  a 
list  of  symbols  for  this  section.  The  basic  theoretical  framework  for  the  equations  is 
based  on  position-dependent  band  structure  transport  model  presented  by  Marshak 
[1).  This  approach  references  the  electrostatic  potential  to  the  vacuum  level.  The 
vacuum  level  merely  provides  a  convenient  continuous  reference  level.  The  electron 
affinity  is  assumed  to  be  a  phenomenological  parameter  and  is  chosen  so  as  to  yield 
the  correct  band  offsets  between  different  materials.  This  model  is  essentially  the 
Anderson  model  which,  although  suffering  from  criticism  in  recent  years,  still  serves 
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as  a  convenient  phenomenological  starting  point.  The  substrate  ( i.e.  the  last  layer)  is 
chosen  as  the  potential  reference.  With  theses  choices,  the  electron  and  hole  densities 
in  non-equilibrium  can  be  written  as 


n  =  iVcF. 
2 


(V{x)  +  xjx)  -  o„(j)\ 

I  kT  )' 


(1) 


and 

\/(x)  -h  Y(.r)  Egjx)  -  <f>p(x) 

kT 

respectively.  Given  impurity  densities  as  a  function  of  position,  ionized  impurity 
distributions  in  non-equilibrium  are  given  by 


Ni  =  Nd{x) 


1  - 


1 


and 


1  4.  i  exp 


:Y4(.r) 


)J 


(3) 


(4) 


^exp  ^  V(x)+\{x)+Eg(i)^E^BM-'i>p{x)^ 

for  donors  and  acceptors  respectively.  Similar  expressions  yield  ionized  trap  densities 
for  donor-like  traps,  Nqj,  and  acceptor-like  traps,  with  quasi-Fermi  levels  for 
electrons  and  holes  replaced  with  quasi-Fermi  levels  for  traps,  (I>dt  and  (j>AT  [2]-  The 
emission  and  capture  rates  of  the  traps  are  calculated  using  the  model  from  [2]  and 
the  trap  quasi-Fermi  level  is  updated  at  each  timestep.  These  terms  are  the  source 
terms  for  the  Poisson  equation  which,  in  one  dimension,  may  be  written 


Note  that  writing  the  equation  in  this  form  automatically  handles  boundary  condi¬ 
tions  at  interfaces  between  materials  with  differing  permittivities. 
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The  electron  and  hole  currents  may  be  written,  in  general,  as 


Jn  =  9/i«(x)n(i)- 


Jp  =  nwMx)- 


In  this  form,  heterojunctions  are  handled  naturally  in  the  drift-diffusion  approxima¬ 
tion,  with  additional  currents  resulting  from  position  dependent  density  of  states  and 
electron  affinity.  This  is  thus  a  more  convenient  form  for  the  cunrenit  density  than 
summing  a  drift  term  and  a  diffusion  term.  The  mobility  is  assumed  to  be  given  by  a 
low-held,  low-doping  limit  multiplied  by  terms  which  account  for  impurity  scattering, 
field  dependence  and  temperature  dependence.  The  parameters  which  describe  these 
effects  are  contained  in  a  database  of  materials  constants  and  are  ac^  «  when  the 
program  initializes.  Values  of  mobilHy  are  also  interpolated  for  ternary  materials 
using  the  empiiiral  expressions  given  by  '^arandet  et  al.  [3]. 

Additional  currents  also  occur  locally  through  recombination  and  generation.  Car¬ 
rier  recombination  is  assumed  to  be  given  by  the  Shockley-Read- Hall  statistics,  viz., 


U  =  - 


pn  —  nf 


n  -f  p  4-  2n,-  cosh  (  ) 

with  hole  recombination  assumed  to  be  given  by  the  same  expression.  The  recom¬ 
bination  lifetimes  r„  and  Tp  are  given  by  empirical  expressions  which  account  for 
vai'- ‘■'■''S  due  to  doping  [4].  Carrier  generation  may  occur  through  optical  excitation 
and  is  given  at  each  point  by  (5,  6] 


G  =  <t>opa{x, 
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in  which  it  is  assumed  that  light  of  a  single  input  frequency  is  incident.  The  absorption 
coefficient  as  a  function  of  position  and  energy  is  given  by  [7] 


cc{x.hu) 


(10) 


for  frequencies  of  excitation  above  the  direct  bandgap  frequency,  below  which  the 
absorption  coefficient  is  assumed  to  be  zero.  In  [7]  Bennett  and  Soref  report  values 
of  C  for  various  binary  III-V  semiconductors.  Values  for  the  ternary  semiconductors 
used  in  this  work  were  interpolated  from  these. 

The  electron  and  hole  densities  develop  with  time  according  to  the  current  conti¬ 
nuity  equations  given,  in  one  dimension,  by 


dn 

dt 


1^ 

q  dx 


■f  G„  —  f/n 


(11) 


for  electrons  and 


for  holes. 


dt 


(12) 


The  numerical  solution  of  this  system  of  equations  yields  the  potential  and  quasi- 
Fermi  levels  for  electrons  and  holes  at  all  points  in  the  device.  With  these  many 
parameters  of  interest  may  be  calculated.  Terminal  currents  may  be  derived,  two- 
dimensional  charge  densities  at  interfaces  may  be  calculated,  and  the  differential 
capacitance  can  be  deduced  by  calculating  the  difference  in  the  net  charge  between 


two  runs. 
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3  Program  Description  and  Use 


The  program,  which  is  called  hj  Id.  has  more  or  less  been  implemented  in  standard 
FORTR.A.N^  77.  Some  aspects  of  the  program  are  Unix-specific  and  the  best  environ¬ 
ment  is  a  Sun/Sparc  Unix  workstation.  Minor  modifications  may  be  made  to  the  code 
that  will  enable  it  to  run  on  other  platforms..  The  program  executable  file  should  be 
saved  in  some  location  that  is  accessible  by  all  those  who  wish  to  execute  it  and  the 
full  directory  name  of  this  location  should  be  added  to  the  shell  variable  path. 

The  equations  of  the  previous  section  are  all  solved  using  a  finite  difference  method 
with  a  non-uniform  grid.  The  grid  is  automatically  generated  and  care  is  taken  to 
assure  that  the  mesh  spacing  changes  no  more  than  a  factor  of  2  between  successive 
points  to  preserve  the  numerical  accuracy  of  the  approximation.  A  maximum  of  three 
hundred  mesh  points  are  allowed  in  the  simulation.  An  error  message  is  generated 
if  this  limit  is  exceeded.  The  Poisson  equation  is  solved  using  the  simple  algorithm 
presented  by  Mayergoyz  [8].  This  method  basically  discretizes  Equation  5  according 
to  the  finite  differences  prescription  and  solves  the  resulting  system  of  non-linear 
equations  using  a  relaxation  technique.  In  [S]  it  was  demonstrated  that  this  method 
is  unconditionally  convergent.  Convergence  is  generally  indicated  if  the  maximum 
change  in  voltage  is  less  than  10~®  kT  in  a  given  iteration. 

The  solution  of  the  current  continuity  equation  is  somewhat  more  troublesome. 
The  discrete  form  of  the  divergence  results  in  basically  a  sum  of  currents  into  and  out 
of  meshpoints.  This  essentially  means  evaluating  the  currents  between  meshpoints 
and  this  re<iuires  an  estimate  of  the  carrier  concentrations  between  meshpoints.  Us- 
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ing  a  simple  linear  average  of  carrier  concentrations  yields  an  unstable  system  of 
equations.  Scharfetter  and  Gummel  [9]  solved  this  problem  by  expanding  the  carrier 
concentration  between  meshpoints  so  as  to  arrive  at  a  co-  stant  current.  This  method 
provides  a  convergent  algorithm  in  most  cases.  Again  convergence  is  indicated  by  a 
maximum  change  in  quasi-Fermi  level  for  electrons  and  holes  of  less  than  10“®  kT. 

The  overall  solution  for  steady  state  is  arrived  at  by  successively  solving  the  Pois¬ 
son  equation  and  current  continuity  equation  in  sequence.  During  the  Poisson  solu¬ 
tion,  the  quasi-Fermi  levels  are  assumed  to  be  held  fixed,  while  during  the  current 
continuity  solution  the  potential  is  assumed  to  be  held  fixed.  This  proceeds  back  and 
forth  until  self-consistency  is  achieved.  For  a  transient  solution,  data  from  a  previ¬ 
ous  run  is  used  with  the  time  derivatives  in  Equations  (11)  and  (12)  approximated 
by  a  backward  difference  approach.  The  simulation  solves  the  Poisson  and  current 
continuity  equations  in  sequence  until  the  discrete  form  of  (11)  and  (12)  is  satisfied. 
The  solution  of  the  current  continuity  eq\iation  is  guaranteed  to  converge  with  this 
approach.  It  has  not  yet  been  proven  whether  the  Poisson-current  continuity  system 
is  convergent  using  this  approach.  The  program  will  exit  and  print  a  summary  of 
results  if  both  the  Poisson  and  current-continuity  solvers  have  converged. 

This  program  incorporates  the  ability  t  model  a  single  trap  level  within  the  band 
gap.  The  capture  cross  section  and  areal  density  of  interface  traps  may  be  entered  at 
any  point  in  the  device.  The  two-dimensional  density  of  traps  is  assumed  to  be  spread 
out  over  a  user-specified  layer  thickness  which  may  be  as  thin  as  1  A.  Thus  these  traps 
are  ostensibly  treated  as  a  three-dimensional  density  internal  to  the  program.  The 
emission  and  capture  rates  of  electrons  and  holes  to  both  bands  are  calculated  and 
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Ohmic  Contact  Device  Structure 

Temperature 

300.0 

Device  Area  (m''2) 
lOOe-12 

Nsrf  Esrfb  Phib 
contl  type^ohmic 
cont2  type^ohmic 
Layer  Descriptions 

Mat.  X  Grade  Th(A)  Nd  Grade  Edb  Na  Grade  Eab  Max  Mesh 
algaas  x=0.30  th=‘360  nd=1.0el7  edb=0.006  maxmsh=40 
algaas  x*0.30  th**60  nd>"1.0el7  edb-0.006  maxmsh*>10 
algaas  x“0.0  th=5000  nd“1.0el7  edb=0.006  maxmsh=1000 


Figure  1:  Device  file  for  an  AlGaAs/GaAs  Heterojunction 

the  quasi-Fermi  levels  for  the  traps  are  updated  at  each  timestep.  Thus  the  transient 
occupation  of  trap  states  at  an  interface  may  be  accounted  for  and  the  concomitant 
effects  on  the  characteristics  of  an  InP  MIS  capacitor  will  be  presented  in  the  next 
section. 

The  user  input  to  the  program  is  an  ASCII  text  device  file  which  provides  a  de¬ 
scription  of  the  device  to  be  simulated.  The  general  format  of  the  file  is  shown  in 
Figure  1.  The  first  two  lines  are  comments,  followed  by  a  line  with  the  device  sim¬ 
ulation  temperature.  Next  is  another  comment  line  and  the  device  area.  Another 
comment  line  follows  and  the  next  two  lines  are  the  top  and  bottom  contact  descrip¬ 
tions.  If  type=ohmic  appears  on  the  line,  the  program  assumes  no  built-in  barrier 
exists  and  that  equilibrium  exists  at  the  contact  meshpoint.  Voltages  are  applied 
by  changing  the  quasi-Fermi  level  and  conduction  band  edge  and  leaving  these  as 
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X 

Mole  fraction  (ternaries  only) 

0 

xgrda 

Parabolic  material  grade  factor  (4-) 

0 

xgrdb 

Linear  material  grade  factor  (-?,) 

0 

th 

Layer  Thickness  (k) 

1000 

nd 

Donor  density  (cm~3) 

0 

ndgrda 

Parabolic  donor  grade  factor  (-}-) 

0 

ndgrdb 

Linear  donor  grade  factor  (i) 

.4 

0 

edb 

Donor  binding  energy  (eV) 

0.006 

na 

Acceptor  density  (cm“3) 

0 

nagrda 

Parabolic  acceptor  grade  factor  (-1-) 

Linear  acceptor  grade  factor  (4) 

0 

nagrdb 

0 

eab 

.Acceptor  binding  energy  (eV) 

0.02S 

maxmsh 

Maximum  mesh  spacing  for  layer  (A) 

1000 

ndt 

Donor-like  trap  density  (cm~^) 

0 

edt 

Donor-like  trap  binding  energy  (eV) 

0 

sign 

nat 

Donor-like  trap  capture  cross  section  (cm"^) 
Acceptor-like  trap  density  (cm"^) 

0 

oat 

•Acceptor-like  trap  binding  energy  (eV) 

0 

sigp 

munmod 

Acceptor-like  trap  capture  cross  section  (cm~^) 
Electron  mobility  model  (see  text) 

0 

mupmod 

Hole  mobility  model  (see  text) 

0 

Table  2:  Table  of  layer  description  paramters  and  default  values. 

boundary  conditions  for  the  current  continuity  and  Poisson  solvers  respectively.  If 
type»schottky  appears,  then  the  phib®xx  is  scanned  and  the  quasi-Fermi  level  is 
assumed  to  be  pinned  at  the  value  given  (eV)  below  the  conduction  band  edge.  Next 
follows  two  comment  lines  and  the  layer  descriptions.  Each  layer  description  line  be¬ 
gins  with  the  name  of  the  material  and  is  followed  by  a  list  of  keywords  with  an  equal 
sign  and  a  value  (no  spaces  are  allowed  except  between  parameters).  The  material  is 
the  only  position  dependent  parameter  in  the  layer  description.  Table  2  describes  the 
parameters  which  appear  on  the  line  of  the  layer  description  and  the  default  value 
of  each  parameter.  The  materials  which  are  presently  included  in  the  database  are 
Si02,  InP,  AUGai_xAs,  InjGai_:,As,  and  In^Ah-^As.  The  device  file  should  be  in 
the  current  directory  and  it  is  generally  reccomended  that  a  separate  sub-directory 
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be  created  for  each  unique  device  before  running  the  program.  If  any  changes  are 
made  to  the  device  file  which  result  in  a  different  mesh,  {e.g.  additional  layers  or 
thickness  changes,  maximum  mesh  spacing  changes,  etc.)  then  a  new  run  should  be 
made.  This  is  defined  here  as  a  “cold”  start. 

All  the  materials  files  and  files  which  provide  tabulated  values  of  the  Fermi-Dirac 
integrals  which  appear  in  the  equations  should  be  saved  in  a  directory  which  is  pointed 
at  by  the  environment  variable  DATAPATH,  This  environment  variable  should  be  set 
(using  the  Unix  command  setenv)  in  the  user’s  .cshrc  file. 

Assuming  at  this  point  that  the  user  has  generated  a  device  file  in  a  sub-directory, 
the  program  may  be  invoked  by  “hjld  device-file  args".  The  device  file  name  will  be 
assumed  to  end  in  .dev  if  an  extension  is  left  off.  Table  3  presents  a  list  of  command 
line  arguments  that  may  be  used  with  the  program.  Note  that  the  equal  sign  and  a 
number  are  required  for  all  command-U.’^e  arguments  except  the  device  file  name  and 
cold. 

A  typical  run  will  now  be  described,  Assume  that  the  device  file  of  Fij;ure  1 
has  been  entered  in  a  file  which  has  been  named  contact. dev.  The  proceduK^  for 
generating  the  steady-state  I-V  characteristics  from  0  to  1.0  volts  of  applied  bias 
will  now  be  described.  First,  a  cold  run  must  be  made  to  arrive  at  the  equilibrium 
solution  of  the  Poisson  equation.  This  is  accomplished  by  issuing  the  command  “hj  Id 
contact  cold”.  This  generates  a  number  of  raw  and  processed  data  files.  The  raw 
flJes  may  essentially  be  ignored,  while  the  processed  files  may  be  used  to  generate 
graphical  output  of  the  simulation  results.  The  processed  data  files  are  band.dat, 
pn.dat,  recomb.dat,  ionized.dat,  jn.dat,  and  jp.dat.  These  are  respectively. 
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cold 

Indication  of  an  equilibrium  (0  bias)  run 

no 

imain= 

Number  of  main  iterations  to  perform 

1000 

ipoi= 

Number  of  Poisson  iterations  per  main  loop 

1 

icur= 

Number  of  Current  iterations  per  main  loop 

1 

popt= 

Optical  power  incident  (W) 

0 

eopt= 

Energy  of  incident  photons  (eV) 

0 

delt= 

Timestep;  simulated  time  increment  beyond  saved  run  (s) 

lel2 

vg* 

Voltage  to  apply  to  surface  contact  (V) 

Previous  Voltagi 

ganuna= 

Over-Relaxation  parameter  for  iterative  solution 

1 

Table  3;  Table  of  l>e  and  default  values. 

band  diagram,  electron  and  hole  concentration,  recombination  rate,  ionized  impurity 
density,  electron  current  density,  and  hole  current  density  vs.  position  in  the  device. 
These  files  contain  x  —  y  coordinate  pairs  describing  the  aforementioned  data.  Data 
will  also  appear  on  the  screen  describing  the  progress  of  the  simulation.  A  summary  of 
the  device  will  be  printed  out  and  an  iteration  by  iteration  account  of  the  simulation 
is  printed.  A  summary  of  results  is  printed  after  the  simulation  converges.  The 
summary  lists  terminal  currents,  capacitances,  and  areal  electron  and  hole  densities. 
This  summary  may  be  saved  to  a  file  by  redirecting  the  output  of  the  program  to 
a  file  {e,g.  hjld  contact  cold  >out  will  redirect  the  output  to  a  file  called  out). 
Next,  a  “saveit”  command  should  be  issued  (this  is  a  shell  script  which  should  be 
installed  in  the  same  directory  as  hjld).  This  command  saves  the  most  recently 
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generated  solution  for  future  use.  Next,  the  command  “hjld  contact  vg=0.02”  is 
issued  which  will  simulate  the  device  with  an  applied  bias  of  .02  \'.  .Again  the  output 
of  the  simulation  may  be  saved  to  a  file  for  later  use. 

Following  the  convergence  of  this  run,  a  projection  technique  may  be  used  to 
generate  the  initial  guess  for  the  next  run.  This  is  calculated  from  the  present  run 
and  the  saved  run.  Therefore  the  projected  run  will  be  at  .04  V.  The  command  .  ^ed 
is  project.  Next,  another  saveit  is  issued  to  save  the  present  run  and  then  the 
command  mvit  is  issued  to  move  the  projected  run  into  place  in  preparation  for  the 
next  run.  Finally,  “hj  Id  contact”  runs  the  program  for  on  the  recently  projected 
data. 

The  procedure  from  here  on  up  to  1  V  of  bias  is  the  following: 

'/.  project 
*/,  saveit 
y,  juvit 

y,  hjld  contact 

Note  that  the  voltage  step  per  run  is  given  by  the  choice  of  bias  for  the  second  run 
(the  one  following  the  cold  start).  It  is  possible  that  if  the  voltage  step  is  too  large 
that  this  run,  or  perhaps  a  subsequent  run,  will  diverge.  This  is  generally  indicated 
by  currents  that  are  nonsensically  large  or  by  current  continuity  which  is  not  satisfied. 
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AlGaAs/GaAs  Hetero junction  Band  Diagram 


Figure  2:  Band  Diagram  of  the  AlGaAs/GaAs  Heterojunction. 


4  Example  Results 

The  results  of  several  example  devices  are  presented  in  this  section..  First  the  example 
run  of  the  previous  section  is  examined.  This  device  is  basically  a  heteroj unction 
ohmic  contact  and  would  be  commonly  be  found  in  a  MODFET  structure.  Both  the 
surface  and  substrate  contacts  are  assumed  to  be  ohmic.  Since  the  doping  level  is 
only  10'^  cm^  The  heteroj  unction  itself  dominates  the  I-V  characteristics.  Figure  2 
shows  the  equilibrium  band  diagram  of  the  structure.  This  result  would  be  stored  in 
the  band .  dat  file  after  the  cold  start  run.  Figure  3  shows  the  I-V  characteristics  of 
the  structure.  Note  the  non-linearity  in  the  characteristics  and  the  assymetry. 

To  demonstrate  the  programs  opto-electronic  device  capability,  the  simulation  of 
an  InP/InGaAs  heterojunction  phototransistor  are  presented  next.  Figure  4  shows 
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Current  Density  <A  /  100  sq.  microns) 


Figure  3:  Current- Voltage  characi«  ristic  of  an  AlGaAs/GaAs  Heterojunction. 
InG»As/InP  8«nd  rJ'igriB;  Vce»1.0.  Pl»250uW 


Distance  fpoiti  Surface  Inicronsl 

Figure  4:  Band  diagram  of  an  InP/InGaAs  heteroj unction  phototransistor. 


0.0  0.5  1.0 

Applied  Voltage  (Vce) 


Figure  5:  Common  emitter  I-V  characteristics  of  InP/InGaAs  heterojunction  photo¬ 
transistor. 

the  band  diagram  of  an  InP/InGaAs  HPT  under  a  bias  of  1  V  and  an  incident 
optical  power  of  250^ W.  The  device  area  is  7  x  10~*  m“^.  The  common-emitter 
characteristics  of  this  device  for  different  values  of  optical  input  power  are  given  in 
Figure  5.  The  incident  photons  have  an  energy  of  1  eV  and  at  this  energy  pass 
through  the  transparent  InP  emitter  layer  and  are  absorbed  in  the  base  and  collector 
layers.  This  device  has  a  base  width  of  O.l/im.  Figure  6  demonstrates  that  the  gain 
of  the  device  is  a  weak  function  of  the  optical  input  intensity. 

Finally,  and  the  actual  main  thrust  of  the  project,  the  results  of  simulations  of 
an  Si02/InP/InAlAs  MIS  capacitor  are  presented.  FETs  made  from  this  structure 
show  great  promise  in  high  frequency  applications.  This  is  basically  a  depletion  mode 
device  and  the  (undoped)  InAlAs  layer  provides  a  barrier  which  must  be  depleted 
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Figure  6:  Gain  versus  input  optical  power  of  InP/InGaAs  hetero junction  phototran¬ 
sistor. 

through  to  shut  the  device  off.  The  band  diagram  is  shown  in  Figure  7  The  first 
simulations  of  this  device  were  of  the  ideal  structure,  i.e.  with  no  interface  states. 
The  capacitance  versus  voltage  is  shown  in  Figure  8  for  three  different  cases.  In 
the  deep  depletion  case,  a  voltage  ramp  is  applied  to  the  gate  of  the  device  which 
is  sufficiently  fast  that  a  hole  inversion  layer  cannot  form.  The  high  frequency  case 
interperses  steady  state  runs  to  adjust  to  a  new  bias  with  a  1/is  0.1  V  ramp  applied 
thereafter  to  simulate  the  high  frequency  capacitance  measurement. 

To  demonstrate  the  capability  of  simulating  interface  traps  in  this  device  a  series 
of  runs  was  undertaken  with  a  mid-gap  donor-like  trap  at  the  Si02/InP  interface. 
The  density  of  these  traps  is  10^^  cm"^.  This  density  of  traps  will  cause  a  distortion 
of  the  CV  curve  at  a  gate  bias  at  which  the  interface  quasi-Fermi  level  passes  through 
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Figure  9:  InP/In.AlAs  MIS  capacitor  CV  curves  with  10*^  cm“^  donor-like  traps  at 
mid-gap.  Runs  with  different  capacitance  signal  ramp  rates  are  also  shown,  demon¬ 
strating  the  transient  capability. 

tiie  trap  level.  This  distortion  is  shown  in  Figure  9  in  which  it  may  be  noted  that 
the  capacitance  rises  to  the  oxide  capacitance  as  the  traps  begin  to  empty.  Different 
measurement  times  for  the  capacitance  sensing  signal  are  given  in  the  figure  showing 
the  transient  trap  occupatio**  for  finite  measurement  times.  This  indicates  that  under 
certain  me«isurcment  conditions  this  juncture  will  show  strong  hysteresis  effects  as 
the  capacitance  is  measured. 

5  Future  Work 

The  simulation  program  described  in  this  report  is  by  no  means  complete.  The  next 
obvious  step  is  to  implement  a  full  blown  interface  trap  model  which  will  allow  a 
distribution  of  traps  and  capture  cross  ections  throughout  the  forbids' m  gap.  This 
will  allow  modeling  of  the  capacitor  to  be  compared  to  the  actual  devices.  This 
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addition  to  the  model  should  be  a  straight-forward  extension  of  the  single  trap  levH 
that  is  currently  implemented.  This  model  would  be  similar  to  the  one  presented  by 
Hasegawa  et  al.  [10,  11]. 

The  program  also  contains  the  skeletal  beginnings  of  a  three-valley  drift-diffusion 
model.  Fleshing  this  out  would  allow  the  modeling  of  devices  in  which  the  material 
grade  gives  rise  to  crossover  into  indirect  bandgap  materials.  Such  devices  include 
laser  structures  with  graded  index  of  refractions  to  permit  coninement  of  radiation. 

High  field  effects  are  modeled  presently  through  the  use  of  three  mobility  models. 
With  the  three  valley  simulation,  it  may  be  possible  to  model  these  non-stationary 
effects  in  a  more  physical  manner.  Work  is  underway  to  investigate  this  possibility. 

In  addition  to  this  the  addition  of  quantum  effects  is  also  under  investigation. 
Such  effects  would  include  carrier  quantization  at  interfaces  and  tunneling  transport. 

Finally,  the  model  will  be  extended  to  two  dimensions.  This  is  perhaps  the  most 
important  step  in  modeling  the  power  MISFET.  This  work  would  provide  some  insight 
into  some  of  the  anomalous  behavior  the  devices  have  exhibited. 
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FDTD  ANALYSIS  OF  THE  RADIATION  PROPERTIES 
OF  A  PARABOLIC  CYLINDER 
ILLUMINATED  BY  A  VERY  SHORT  PULSE 
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ABSTRACT 

This  report  addresses  the  problem  of  extremely  short-time  pulse  excitation  of 
a  parabolic  reflector  antenna.  The  pulse  is  selected  to  be  short  compared  to  the 
time  required  for  it  to  traverse  the  reflector,  so  that  only  a  fraction  of  the  reflector 
is  illuminated  at  any  given  time.  Because  the  pulse  is  so  short,  standzird  frequency 
domain  methods  of  analysis  are  impractical.  Instead,  the  Finite  Difference  Time 
Domain  method  is  used  to  analyze  the  wave  propagation  and  reflection.  The  field 
distributions  have  been  simulated  for  a  deep  parabola  in  both  transmit  and  receive 
modes.  The  results  indicate  that  commonly  perceived  as.'-amptions  of  differentizd 
transient  reflection  are  inaccurate. 


INTRODUCTION 

As  signal  generation  technology  improves,  the  need  to  understand  the  behavior 
of  shorter  and  shorter  pulses  becomes  increasingly  important.  When  the  duration 
of  the  pulse  becomes  comparable  to — or  shorter  than — the  typical  length  scale  of 
an  antenna  system,  new  ways  of  studying  electromagnetic  field  propagation  become 
necessary. 

One  important  problem  that  has  recently  been  studied  [1,2]  is  the  excitation  of 
a  parabolic  refiector  antenna  by  a  pulse  that  is  shorter  than  its  focal  length.  The 
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traditional  analysis  method  of  tracing  rays  is  brought  into  question  since  the  rays 
do  not  all  illuminate  the  reflector  simultaneously. 

In  fact,  Hansen  first  states  [1]  that  the  leading  and  trailing  edges  of  a  short 
pulse  have  different  antenna  patterns,  lezuiing  to  a  differential  transient  radiation 
characteristic.  The  leading  portion  of  the  transmitted  wave  is  first  reflected  by  the 
reflector  surface  in  a  circle  nearest  the  vertex  while  the  reflector  rim  is  imilluminated, 
while  subsequently,  the  trailing  pulse  edge  has  left  the  vertex  it  still  illuminates  an 
annular  region  near  the  rim.  See  Figure  1.  The  conclusion  is  that  the  initial 
radiation  pattern  is  that  of  a  smaller  diameter  apertture,  while  the  final  pattern  is 
that  of  an  annulus  with  constant  outer  diameter. 

This  view  was  later  reversed  [2]  with  an  argument  that  rays  diffracted  from  the 
edges  of  the  reflector  leave  the  antenna  first,  before  refiected  rays  from  the  vertex. 
The  transient  pattern  should  instead  be  that  of  an  annular  aperture  first,  followed 
by  the  gradually  shrinking  circle. 

The  object  of  the  cxirrent  study  is  to  show  that  neither  conclusion  is  accurate, 
but  instead  that  the  parabolic  reflector  has  no  characteristic  transient  behavior 
other  than  a  simple  aperture.  Two  approaches  will  be  used:  first,  a  geometric 
argument  based  on  a  limiting  case,  and  second,  a  numerical  study  using  the  Finite 
Difference  Time  Domain  (FDTD)  method.  The  FDTD  method  is  not  based  on 
extending  frequency  domain  analysis  across  a  broad  frequency  spectrum,  but  instead 
formulates  the  wave  propagation  completely  in  the  time  domain.  The  shorter  the 
time  pulse,  the  easier  it  is  to  model  the  field  interaction. 

Although  it  must  be  cautioned  that  using  numerical  methods  to  prove  theo¬ 
retical  assertions  is  dangerous,  it  is  the  transient  behavior  that  is  of  concern,  and 
only  a  procedure  which  emphasizes  the  different  tempered  aspects  of  the  field  can 
be  effectively  employed. 
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GEOMETRIC  ARGUMENT  FOR  UNIFORM 
TEMPORAL  BEHAVIOR 

It  is  well  known  that  the  paraboloid  of  rotation  transforms  microwave,  single 
frequency  spherical  waves  centered  at  its  focus  into  plane  waves  leaving  the  reflector. 
Geometric  optics  is  used  as  a  Wgh  frequency  approximation,  treating  waves  as 
surfaces  perpendictdar.  to  idealized  rays.  Equal  path  length  of  every  ray  traced 
from  the  focus  to  the  reflector  and  then  to  an  aperture  plane  perpendicular  to  the 
axis  of  rotation  ensure  that  a  planar  phase  front  will  be  formed.  Thus  even  though 
different  parts  of  the  reflector  are  illiuninated  at  different  times,  eiU  rays  arrive  at 
an  aperture  plane  simultaneously. 

The  geometric  optics  approximation  breaks  down  for  lower  frequencies — more 
precise  analyses,  based  directly  on  the  wave  equation  must  be  used.  Consider  for 
computational  simplicity  a  two  dimensional-  parabolic  reflector  illuminated  by  a 
longitudinally  polarized,  uniform  cylindrical  wave  with  center  at  the  focal  point. 
This  incident  wave  is  given  by: 

=  (1) 


(2^ 

where  Hq  '  is  the  second  order  Hankel  function  and  k  is  the  wave  number.  The 
desired  reflected  and  transmitted  plane  wave  propagates  in  the  y-direction,  and  is 
represented  by: 

=  (2) 

with  some  yet  to  be  specified  tunplitude  dependence  f{x). 

The  equation  for  the  parabola  with  focal  length  /  is: 


(3) 
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This  geometry  is  shown  in  Figure  2.  Dotted  plane  wavefronts  are  continued  past 
the  reflector  to  indicate  the  virtual  planar  source  corresponding  to  the  focal  point 
If  the  parabola  is  a  perfect  conductor,  the  botmdary  condition  that  tangential 
electric  field  be  zero  requires: 

E  =  Ei  +  Er  (4) 


on  the  parabola.  In  fact,  the  condition  is  never  exactly  satisfied  with  the  desired 
choice  of  transmitted  wave,  but  the  actual  error  is  quite  small  for  most  frequencies 
of  interest. 

To  quantify  the  error,  first  asjrmptotically  approximate  the  Hankel  function 
with: 


Bf\ki>)  = 


y/kf)irf2 


e-i*(»+2/)-i»r/4 


(5) 


y/k{y  +  2/)ir/2 


where  the  second  equation  specifies  the  cylindrical  field  on  the  parabola.  Since  the 
approximate  phase  dependence  of  the  incident  field  of  Equation  (5)  is  the  same  as 
that  of  the  reflected  field,  Equation  (2),  the  boundary  condition  can  be  satisfied,  if 
the  amplitude  of  the  reflected  wave  is  set  to  f{x)  =  /  y/kir{x^ /Af  +  f)/2 

on  the  parabola.  It  remains  to  measure  how  good  the  asymptotic  approximation  is. 

Figures  3a  and  3b  show  the  difference  in  phase,  and  the  ratio  of  the  magnitude 
difference  to  magnitude  between  the  Hankel  function  and  its  approximation  as  a 
function  of  electrical  radius.  Note  that  at  as  little  as  one  wavelength  {kp  =  6.28) 
the  phase  error  is  about  .02  radians  (roughly  1.5°),  and  the  magnitude  difference 
is  one  part  in  one  thousand.  The  asymptotic  approximation  is  very  good.  For  any 
practical  reflector  antennas,  the  focal  length  would  be  at  least  tens  of  wavelengths. 
The  error  would  indeed  be  negligible. 

For  all  frequenc’es  of  interest,  the  parabola  reflects  waves  as  expected.  For 
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transient  illumination,  especially  very  short  pulse  excitation  with  predominzintly 
high  frequency  components,  each  of  the  separate  frequencies  in  the  pulse  spectrum 
is  reflected  in  the  same  way.  Each  frequency  component  is  imaged  by  the  reflector 
into  a  plane  wave,  originating  at  the  image  plane  indicated  on  Figure  2. 

Each  frequency  has  a  ^separate  propagation  constant,  so  it  arrives  at  a  arbi¬ 
trary  aperture  plane  with  different  phase  and  amplitude,  but  none  of  the  individual 
components  is  reflected  differently.  The  transient  nattire  of  the  pulse  with  many 
frequency  components  shows  up  as  a  possible  change  in  pulse  shape  at  the  aperture, 
but  in  no  (high  frequency)  case  is  there  any  transient  variation  across  the  aperture. 
Of  course,  once  the  wave  leaves  the  reflector,  diffraction  at  the  reflector  rim  will 
affect  each  frequency  component  differently,  but  no  differently  than  an  aperture 
in  a  conducting  plane  affects  a  normally  incident  plane  wave.  Thus  there  is  no 
differential  transient  effect  on  the  transmitted  wave  due  io  the  parabolic  reflector. 


THE  FDTD  METHOD  OF  WAVE  ANALYSIS 
To  confirm  the  geometrical  argument  presented  above,  the  transient  wave  be¬ 
havior  is  now  analyzed  directly,  using  a  time  domain  numerical  method:  Finite 
Difference  Time  Domain  (FDTD).  This  method  was  first  proposed  on  a  cubical  lat¬ 
tice  by  Yee  [3],  and  then,  more  recently,  popularized  by  others  [4-6].  Other  Finite 
Difference  techniques,  which  make  use  of  triangular,  conformal  meshes  in  both  the 
time  and  frequency  domains  have  been  presented  [7,8]. 

The  idea  behind  FDTD  is  that  Maxwell’s  partial  differential  equations  are  re¬ 
placed  by  multi-dimensionsd  centered  difference  equations  in  space  and  time.  First, 
space  is  discretized  into  two  sets  of  interlocking  cubical  meshes,  with  the  cube  cor¬ 
ners  of  one  mesh  '•oinciding  with  the  cube  centers  of  the  other.  The  magnetic  flux 
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of  Faraday’s  Law  is  solved  at  the  center  of  every  cube  face  in  terms  of  the  electric 
field  on  the  edges  that  bound  that  face;  and  the  electric  flux  of  Ampere’s  Law  is 
solved  at  the  center  of  every  cube  face  of  the  complimentary  mesh  in  terms  of  the 
magnetic  fields  on  the  edges  that  bound  it.  With  isotropic  media  the  electric  flux 
and  the  electric  field  are  related  by  a  scalar  permittivity  constant,  e,  so  that  know¬ 
ing  the  first  from  Ampere’s  Law  allows  the  computation  of  the  magnetic  flux  from 
Faraday’s  Law.  The  field  values  on  one  mesh  are  computed  at  one-heilf  a  time  step 
eihead  of  those  of  the  other,  so  that  the  difference  equation  in  time  at  one  centered 
sp.T.ce  point  can  be  equated  to  the  difference  equation  in  space  at  the  one  centered 
time  point  of  the  complimentary  mesh. 

One  important  detail,  which  must  be  carefully  addressed  for  exterior,  un¬ 
bounded  geometries,  is  simulating  the  wave’s  radiation  to  infinity.  Although  the 
computational  domain  terminates  at  the  end  of  the  array  lattice,  the  wave  must 
act  as  if  it  were  continuing  outward.  In  other  words,  the  wave  must  not  reflect  off 
the  artificial  boundary.  The  boundary  must  absorb  all  incident  waves;  hence  it  is 
referred  to  as  an  Absorbing  Boimdary  Condition  (ABC).  In  principle,  no  numerical 
condition  can  absorb  waves  from  every  incident  angle,  but  there  have  been  several 
ABC’s  proposed  which  absorb  those  near  normal  incidence  [10-13]  which  makes  use 
of  pseudo-differential  aimihilation  operators.  A  new  ABC  method  which  is  based 
on  simulated  anechoic  chamber  absorber  hais  recently  been  presented  [14].  This 
method  appears  to  be  preferable  for  very  large  scatterers  eind  higher  frequencies. 


SIMULATION  RESULTS  AND  ANALYSIS 
For  the  parabolic  reflection  problem,  a  two-dimensional  FDTD  method  was 
implemented  a  vector-processor  enhanced  Digital  Equipment  Corporation  VAX 
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9000  computer.  The  ABC  used  is  the  Engquist'Majda  [11]  planar  condition  on 
forward,  backward,  right  and  left  edges  of  the  computationeil  domain.  The  parabolic 
segment  is  chosen  to  be  four  times  as  wide  as  deep  {F/D  =  0.25).  This  is  typical 
of  deep  reflector  antennas,  with  focus  in  the  plane  of  the  reflector  rim. 

The  Courant  Conation,  that  the  time  step  times  multiplied  by  phase  velocity  of 
the  wave  must  be  less  than  the  spatial  step  is  satisfied  by  selecting:  r  =  cAt/Ax  — 
0.5. 

Three  wave  reflection  cases  are  considered:  1)  a  received  plane  wave  with  Gaus¬ 
sian  pulse  shape;  2)  a  transmitted  cylindrical  wave  with  Gaussian  amplitude  shape; 
and  3)  a  received  plane  wave  with  a  Gaussian  pulse  shape  modulated  with  a  high 
frequency  carrier  signal. 

The  Gaussian  pulse  with  peak  at  x  =  xq  when  t  =  0  and  1/e  level  width  of  2w 
is  represented  in  space  and  time  as: 


E  =: 


(6) 


for  electric  field,  and  for  magnetic  field: 


/v 


(7) 


For  the  first  test  case,  the  parabolic  focal  length  is  selected  to  be  100  mesh  units, 
u>  =  20  and  xq  =  40  and  the  parabola  vertex  is  placed  at  i  =  180.  To  ensure  that 
no  left  and  right  edge  effects,  it  was  necessary  to  choose  the  mesh  width  to  be  2.5 
times  larger  them  the  length.  That  is,  the  mesh  is  200  by  500  mesh  imits.  Figure 
4a  shows  the  initial  pulse  on  a  50  by  50  point  grid,  where  every  fourth  point  in  x 
and  every  tenth  point  in  y  is  shown.  The  pulse  peak  first  encounters  the  rim  of  the 
parabola  at  x  =  80  after  travelling  40  spatial  steps,  or,  since  r  =  0.5,  in  80  time 
steps. 
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Figure  4b  shows  the  beginning  of  the  reflection  at  the  reflector  rim  at  t  =  100. 
Note  that  the  reflected  wave  is  negative,  as  expected.  For  time  samples  t  =  200, 
300,  400,  and  500  the  wave  continues  to  reflect  and  then  focus  at  the  focal  point, 
as  shown  in  Figures  4c-4f.  The  pulse  peak  passes  the  focus  after  480  time  steps.  It 
is  clear  in  Figure  4f,  with  greater  amplitude  scale  than  the  previous  time  samples 
that  the  wave  is  focusing  with  greatest  intensity  at  the  parabola  focus. 

For  the  second  case,  a  cylindrical  wave  is  specified  for  a  transmitted  field.  Using 
the  asymptotic  approximation  for  the  Hankel  function  given  above — along  with  a 
Gaussian  amplitude  envelope  in  radius — gives  the  transient  pulse  initial  conditions: 


E  =  z- 


vWo 


(8) 


and  for  magnetic  field: 


H  =  <l>- . 7==^- 

^y/plpo 


(9) 


Note  that  these  initial  conditions  are  approximations  to  an  actual  cylindrical  wave 
at  a  given  time  sample,  but  that  these  initial  conditions  do  not  have  to  be  exact 
for  a  cylindrical  wave  to  propagate.  Once  the  algorithm  starts,  the  laws  of  wave 
propagation  are  strictly  followed.  Any  imperfections  in  the  initial  conditions  will 
lead  to  an  inward  travelling  wave,  or  a  change  in  the  outward  cylindrical  wave  shape. 

Figure  5a  shows  the  initial  wave  after  50  time  steps,  before  it  has  encoimtered 
the  reflector.  The  mesh  is  500  by  500  points,  with  every  tenth  value  displayed, 
and  with  po  =  55  and  w  —  15.  The  wave  spreads,  reflects  off  the  parabola,  and 
forms  a  plane  wave,  as  indicated  in  Figures  5b-5d,  corresponding  to  time  samples 
at  t  =  150,  350,  and  550.  The  reflected  wave  is  quite  well-formed  with  the  expected 
amplitude  taper  towards  the  edges,  but  with  perfectly  uniform  phase.  In  Figure 
5c,  the  peak  of  the  reflected  wave  has  just  past  the  plane  of  the  reflector  rim.  It 
is  a  reia*^ively  level  planar  phase  front.  In  the  next  Figure,  5d,  the  plane  wave  is 
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beginning  to  spread,  and  its  amplitude  has  more  of  a  taper.  Also  visible  in  Figme 
5d  is  the  absorbing  nature  of  the  back  boundeiry.  The  advancing  cylindrical  wave 
appears  unaffected  by  the  end  of  the  mesh. 

The  third  case  studied  is  a  high  frequency  modulated  Gaussian  pulse.  The 
initial  pulse  condition  is  the  same  as  in  case  1,  Equations  (6,7)  with  a  multiplication 
by  cos  kx.  Figures  6a-6f  show  the  time  evolution  across  a  500  by  500  point  mesh, 
with  every  tenth  value  displayed.  Figure  6a  shows  the  eight  high  frequency  wave 
peaks  within  the  Gaussian  pulse,  just  as  it  encounters  the  parabola  at  <  =  200.  With 
this  choice  of  carrier  wavelength,  the  reflector  has  a  width  of  50A.  The  reflections 
from  the  parabola  are  quite  visible  in  Figure  6b  at  t  =  300.  In  Figme  6d,  at 
t  —  480,  the  pulse  peak  is  at  the  focus,  with  amplitude  of  about  5.5.  The  focal 
point  is  clearly  visible,  highly  resolved  with  this  high  frequency  signal.  Figures  6e 
and  6f  show  the  wave  advancing  past  the  focal  point,  decreasing  in  amplitude  and 
spreading  transversely  while  retaining  its  modidation  in  the  propagation  direction. 


CONCLUSIONS 

An  analysis  has  been  made  of  the  reflection  characteristics  of  parabolic  reflec¬ 
tors  to  short  pulses.  Contrary  to  published  theory,  the  parabola  does  not  produce 
any  rmexpected  trzmsient  effects  beyond  the  standard  dispersive  effects  of  free-space 
propagation  of  a  short  pulse. 

The  FDTD  method  was  used  to  analyze  the  transient  behavior  of  the  two- 
dimensional  parabolic  reflector.  In  all  three  of  the  numerical  cases  considered,  the 
wave  behaved  as  expected,  focusing  to  high  intensity  at  the  parabola  focal  point,  or 
forming  a  planar  phase  front.  The  reflector  seems  to  have  no  differential  effects  on 
the  va'-ious  time  levels  of  the  pulse — the  reflection  of  the  initial,  leading  edge  of  the 
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pulse  joins  correctly  with  the  subsequently  reflected  trailing  portions  of  the  pulse. 

There  will  be  the  same  transient  behavior  of  a  wave  pulse  for  parabolic  re¬ 
flection  as  for  propagation  in  free  space,  but  the  parabola  itself  does  not  cause 
any  transient  non-uniformities  in  phase.  It  was  shown  that  since  each  frequency 
component  of  the  focal  source  signal  is  imaged  to  a  planar  source  one  focal  distance 
behind  the  reflector,  the  entire  spectrum  of  frequency  components  in  the  short  pulse 
is  imaged  to  that  plane,  and  the  parabola  can  be  replaced  by  that  planar  source 
(with  appropriate  amplitude  weighting)  illuminating  an  aperture  in  an  conducting 
screen. 

For  defocused  sources,  which  give  scanned  beams,  the  reflection  properties  of 
the  parabola  are  quite  frequency  dependent.  In  this  case,  there  are  geometric  path 
length  errors  for  every  incident  ray.  For  each  frequency  component  the  physical 
error  corresponds  to  a  different  electrical  phase  error.  The  short  time  pulse  will 
reflect  very  differently  from  a  constant  frequency  source,  and  the  transient  antenna 
response  will  be  entirely  different  from  a  tilted  plane  wave  illuminating  an  equivalent 
aperture. 
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t  <  1,  t  >  t, 

Illuminated  Region  of 
Paraboloidal  Reflectar 


Figure  2;  Ray  tracing  and  wave  reflection  by  a  parabola. 
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reflector  on  a  500  by  500  point  grid 


Figure  6  (continued) 


20-20 


THERMODYNAMIC  AND  TRANSPORT  PROPERTIES  OF  MOLECULAR  IONS 


FINAL  REPORT 


PREPARED  BY:  MURTY  A.  AKUNDI,  Ph.D 

ACADEMIC  RANK:  PROFESSOR 

UNIVERSITY  AND  DEPARTMENT:  XAVIER  UNIVERSITY 

PHYSICS/ENGINEERING 
NEW  ORLEANS,  LA  70125 

AFSOR/AEDC/CALSPAN 

DIVISION:  COMPUTATIONAL  FLUID 

DYNAMICS 

FOCAL  POINT:  JAMES  T.  CURTIS  (CALSPAN) 

DATE  SUBMITTED:  AUGUST  3, 1991 


EMPLOYEE  NUMBER:  2 


21-1 


ABSTRACT 


Thermodynamic  properties,  enthalpy,  free  energy  and  heat  capacity  of 
six  ionized  species  (CO+,  Ci,  H20^,  H3",  HsO"",  and  HCO^)  are  calculated 
using  spectroscopic  data.  These  thermodynamic  properties  are  a  function  of 
temperature  and  were  calculated  to  a  temperature  of  20,000  K.  Results  are 
compared  with  the  values  obtained  from  earlier  workers. 

INTRODUCTION 


Currently  envisaged  t'  ransatmospheric  and  aeroassit  missions  have 
created  a  resurgence  of  interest  in  the  aerothermic  design  of  hypersonic 
vehicles.  However,  the  velocities  and  altitudes  at  which  the  aircraft  would 
operate  are  unfamiliar  and  quite  different  and  severe.  As  a  result,  the  non¬ 
equilibrium  flow  environment  which  will  surround  these  vehicles  will 
considerably  impact  the  vehicle  aerodynamics,  thermal  loads,  and  propulsion 
system  efficiency.  The  design  of  future  vehicles  suitable  to  these  conditions, 
requires  careful  physical  modeling  to  simulate  these  phenomena. 

Under  hypersonic  flight  conditions,  a  vehicle  traveling  through  the 
atmosphere  will  excite  the  air,  which  flows  around  the  body,  to  very  high 
temperatures  as  the  kinetic  energy  from  the  vehicle  is  transferred.  This  will 
affect  the  properties  of  air  and  cause  deviation  from  those  of  a  thermally  and 
calorically  perfect  gas.  Under  these  conditions,  the  thermodynamic  and 
transport  properties  of  the  mixture  becomes  a  function  of  pressure  as  well  as 
temperature.  It  is  essential  to  evaluate  these  functions  in  order  to  calculate 
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the  pattern  of  air  flow  about  high  speed  vehicles,  the  viscous  and  pressure 
forces  which  result,  and  the  heat  flux  which  occurs  between  the  air  and  the 
vehicle.  Thermodynamic  and  transport  properties  of  several  species  in  air 
have  been  calculated  by  Gupta,  etal.,  (1989)  to  very  high  temperatures. 

In  addition  entry  probe  heat  shields  are  often  made  of  carbonaceous 
material  such  as  carbon-phenolic  abalator,  a  mechanism  used  for  reducing  the 
intensive  radiative  heating  encountered  during  the  entry  into  the  upper 
atmosphere  (Moss,  etal.  1975, 1976).  These  heat  shields  typically  contain  92% 
carbon,  6%  oxygen,  and  2%  hydrogen  by  mass  (Moss,  et  al.  1976).  This 
ablation  injects  gaseous  carbonaceous  materials  which  drastically  alter  the 
atmospheric  composition  near  the  entry  vehicle,  which  in  turn  changes  the 
thermodynamic  and  transport  properties.  Thus,  enthalpy,  free  energy, 
heating,  diffusion,  and  viscous  flow  rates  are  changed. 

Biolsi  (1978),  Rainwater,  etal.  (1981),  and  Biolsi,etal.  (1981)  investigated 
the  transport  properties  of  monoatomic  carbon  as  well  as  a  mixture  of  the 
ablation  products  C,  C2,  and  C3.  In  this  report  we  present  the  thermodynamic 
properties  of  CO^,  C2',  H20^,  H3*,  H30"’,  and  HCO""  that  are  computed  using 
a  simple  computer  program  developed  at  this  laboratory.  A  general 
discussion  on  the  method  to  calculate  the  diffusion,  viscosity,  and  thermal 
conductivity  from  collision  integrals  using  H-H  potentials  is  provided.  Due  to 
the  non-availability  of  a  computer  program,  transport  properties  of  these 
molecules  could  not  be  calculated.  The  future  plan  is  to  develop  a  suitable 
computer  program  to  evaluate  the  collision  integrals  and  calculate  the 
transport  properties. 


21-3 


PROBLEM  DISCUSSION 


Entry  probe  heat  shields,  at  high  temperatures,  inject  ablative  gaseous 
products  such  as  ionized  carbon  monoxide,  carbon,  water,  and  several  other 
combinations.  Data  on  the  thermodynamic  and  transport  properties  of  these 
species  at  high  temperatures  (>6,000  K)  is  not  available  in  the  literaiare. 
Determination  of  these  properties  is  vital  to  understand  the  air  flow  patterns 
whic^  in  turn  aid  in  the  thermodynamic  design  of  the  hypersonic  vehicles. 

Thermodynamic  Properties: 

The  equilibrium  thermodynamic  properties  of  gas  can  be  calculated  with 
certainty  provided  the  energy  levels  of  the  gas  particles  and  the  degeneracy  of 
these  levels  are  known.  For  most  of  the  diatomic  and  for  some  of  the 
ployatomic  gaseous  species,  this  information  can  be  deduced  from 
spectroscopic  data.  All  of  the  thermodynamic  properties,  I.eat  capacity  (Cp), 
enthalpy  {H°),  free  energy  (G°),  and  entropy  (S’)  of  the  species  can  be 
determined  from  a  knowledge  of  their  partition  functions  (Q).  Consequently, 
the  first  step  is  to  determine  the  partition  functions  of  these  species,  which  is 
defined  as  (Herzbert,  1945,  McBride,  et  al.,  1963) 

Q  =  ^  g^exp-  EUKgT  (1) 
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where  Ej  is  the  energy  of  the  state  of  the  partide  and  gi  is  the  degeneraqr. 
The  energy  may  be  due  to  the  ^nsiational,  vibrational,  or  rotational  motion 
of  the  partide  or  due  to  the  motion  of  electrons  within  the  partide.  Then  the 
partition  function  given  in  Eq.  (1)  can  be  expressed  as  a  product 


t  s?  or  c 
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where  Qt,  Qe,  Ov,  Qr*  Qa#  Qw#  and  Qc  are  the  translational,  electronic, 
vibrational,  rotational,  rotational  stretching,  Fermi  resonance  and  both 
anhormonicity  and  vibration  rotation  interaction,  respectively.  These 
individual  partition  functions  can  be  expressed  as  functions  of  temperature  as 
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(8) 


As  can  be  seen  from  Eqs.  (10)  to  (13),  the  translational  motion  of  the 
particle  contributes  only  to  the  free  energy  of  the  particle.  In  the  calculations 
made  in  this  report,  the  contributions  due  to  rotational  stretching  and  Fermi 
resonance  are  not  considered.  Details  of  the  calculations  and  the  values 
obtained  for  these  properties  are  presented  in  the  results  section. 
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Transport  Properties: 


Transport  properties  of  gases  may  be  evaluated  according  to  the  well 
known  Chapman-Enskog  solution  of  the  Boltzman  equation  (Herschfelder,  et 
al..  1954)  if  tn^  idtermolecular  potential  is  known.  These  intermolecular 
potentials  can  be  easily  evaluated  for  diatomic  molecules  but  it  is  more 
involved  for  polyatomic  molecules.  Within  the  first  Enskog  approximation, 
the  self  diffusion  coefficient  of  a  gas  is  inversely  proportional  to  Q  (1,  1)* 
integral  and  the  shear  viscosity  and  thermal  conductivity  are  inversely 
proportional  to  Q  (2,  2)*  integral.  The  general  form  of  the  reduced  collision 
integral  Q  {e,  s)*  is  represented  as 
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where  Qs  is  a  constant  (Cn  =  1,  €2,2  =  i)>  Kb  'S  Boltzman's  constant,  T  the 
temperature,  E  and  b  the  relative  total  energy  and  impact  parameter  and  X 
the  scattering  angle  given  by 
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=  expl—2a(rfd  —  l)l—2a[p(—atrfd  —  iil 


where 
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After  obtaining  the  numerical  values  of  Q  (1,1)  and  Q  {2.1)  by  evaluating 
the  collision  integrals,  the  transport  properties  can  be  calculated  usmg  the 
following  equations 


Viscosity: 
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Thermal  Conductivity. 
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Diffusion: 
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(19) 


The  problem  at  hand  is  to  develop  numerical  integral  methods  to 
evaluate  the  collision  integrals.  Recently  Rainwater,  etc.  (1982)  have 
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developed  a  computer  program  to  evaluate  these  integrals.  At  present  are 
in  the  process  of  obtaining  this  program  and  modifying  it  for  our  conditions. 
The  symbols  and  their  meaning  of  the  various  parameters  used  in  the 
equations  are  presented  in  the  Appendix. 

RESULTS 

Literature  survey  has  been  made  and  relevant  molecular  parameters  are 
collected  for  the  six  species  to  compute  the  thermodynamic  properties.  In  the 
case  of  molecules  for  which  the  needed  data  is  not  available,  the  necessary 
parameters  (dissociation  energies,  force  constants,  and  excited  state 
electronic  energies)  are  computed  using  appropriate  equations  given  by 
Hertzberg.  Heat  capacity,  enthalpy,  and  free  energy  are  calculated  using  the 
computer  program  developed  earlier  in  this  laboratory  by  Dr.  James  T.  Curtis. 
Table  I  presents  the  molecular  constants  that  are  used  in  calculating  the 
partition  function.  Tables  II  to  Vll  present  the  thermodynamic  properties  of 
the  six  species  to  a  temperature  of  20,000  K.  These  data  have  been  plotted 
and  compared  with  the  data  m  JANNAF,  and  shown  m  Figs.  1  to  3.  Though 
thermodynamic  data  for  some  of  these  molecules  (C2 ,  H3",  and  HCO*)  are 
calculated  by  earlier  workers  (JANNAF,  1974,  McBride,  1967),  they  are  only 
computed  up  to  6,000  K.  In  the  case  of  HCO"  and  HsO’’,  the  vibrational 
frequencies  are  calculated  using  the  spectroscopic  data  of  isoelectronic  species 
NH3  and  HCN.  In  the  present  report,  spectroscopic  data  collected  is  more 
recent  and  accurate  and  is  based  on  experimental  work.  In  addition,  since  the 
thermodynamic  properties  are  calculated  to  high  temperatures,  this  data  will 
be  more  useful  to  understand  the  non-equilibrium  air  flow  conditions  and  to 


21-9 


a  precise  aerothemnic  design  of  the  space  vehicles.  In  h’30*  and  HCO",  only 
ground  state  electron  energy  is  used  in  evaluating  the  electronic  partition 
function,  since  the  excited  state  energy  values  are  not  available  at  the  present 
time.  For  CO*  and  C2*,  all  partition  function  contributions  (except  Qp  and  Qw) 
are  used  to  evaluate  the  thermodynamic  properties.  On  the  other  hand,  in 
the  case  of  polyatomic  molecules  H2O  +  ,  HCO’',  H3*.  and  H3O*  only 
contributions  of  electronic,  rotational,  and  vibrational  partitions  function  are 
considered,  since  complete  data  on  anharmonic  constants  and  rotation 
vibration  interaction  terms  on  the  species  is  not  available  at  the  present  time. 

CONCLUSIONS 

Heat  capacity,  enthalpy,  and  free  energy  of  CO”,  C2',  H2O”,  H3”,  and 
H3O*  have  been  calculated  up  to  a  temperature  of  20,000  K.  Vibration 
rotation  interaction  and  anharmonicity  influence  on  thermodynamic 
properties  is  significant  at  high  temperatures.  In  the  case  of  CO  *  and  C2 ,  this 
effect  is  included  in  our  calculations.  Unfortunately,  we  could  not  study  this 
effect  in  the  remaining  molecules  due  to  insufficient  data.  There  is  a  need  to 
improve  this  data  and  our  future  work  will  focus  on  improving  the  current 
computer  program  to  include  the  affect  of  anharmonicity  and  vibration 
rotation  interaction.  Transport  properties  of  these  six  species  could  not  be 
computed  due  to  the  non-availability  of  a  suitable  computer  program  and 
due  to  the  short  time  I  have  spent  here.  The  current  plan  is  to  keep  in  contact 
with  the  CFD  group  and  continue  the  work  during  the  academic  year. 
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Symmetry  no. 2  is  used  in  these  calculations 
listed  as  A 
listed  as  E 
Listed  as  ZPE 
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APPENDIX  A 
SYMBOLS 


Ag,  3e,  Ce 


A©,  3o,  Co 
Q® 

C 

C2 

Do,  Dqo 
Do“ 

di 

Ft" 

iY-Ho" 

9e 

gii 

Hj? 

Ho" 

Ht"-Ho“ 

AHt" 

h 

J 

Kb 

M 


p 

Q 

Qa 

Qe 

Qm 

Qr 

Qrv 

Qv 

Q« 

R 


'’e 

T 

To 


Rotational  constants  corresponding  to  eguilibrium  separation  of 
atoms,  cm-^. 

Rotationcc  constants  for  lowest  vibrational  states,  an-^. 

Heat  capacity  at  constant  pressure  of  standard  state  cai/mole.  K. 
Velocity  of  light,  2.997925  X 10  'S®cm. 

Second  radiation  constant,  hdK,  1.43880  (cm)  (deg). 

Spectroscopic  constant,  for  rotational  stretching,  cm-i . 

Rotational  stretching  constant  for  lowest  vibrational  state,  cm-'. 
Dissociation  energy  measured  from  the  lowest  vibrational  energy 
level  Cv=0). 

Degeneracy  assodated  with  Vi. 

Gibbs  free  energy  for  standard  state,  cal/mole. 

Sensible  energy  for  standard  state,  cai/mole. 

Electronic  statistical  weight. 

Anharmonidty  constant  for  doubly  degenerate  vibrations  in  linear 
molecules,  cm- 1 

Sum  of  sensible  enthalpy  and  chemical  energy  at  0  K  for  standard 
state,  cal/mole. 

Chemical  energy  at  0  K  for  standard  state,  cal/mole. 

Sensible  enthalpy  for  standard  state,  cal/mole. 

Enthalpy  change  for  formation  of  substance  for  elements  in  atomic 
gas  state,  cal/mole. 

Planack's  constant,  6.6261  X 10-27  (erg)  (sec). 

Principal  moments  of  inertia,  (g)  {cm2)  (mole). 

Rotational  quantum  number. 

Boltzmann  constant,  1.38044  X  10-16  erg/deg. 

Molecular  weight. 

Number  of  unique  frequencies. 

Partial  pressure,  atm. 

Internal  partition  function. 

Anharmonic  correction  factor  for  partition  function. 

Electronic  partition  function. 

Internal  partition  function  for  mih  electronic  state. 

Classic  rotational  partition  function. 

Vibration-rotation  interaction  correction  for  partition  function. 
Harmonic-oscillator  partition  function  due  vibrational  motion. 
Rotational-stretching  correction  faaorto  partition  function. 
Universal  gas  constant,  1.9876  cal/mole  K. 

Internuclear  distance,  Ao. 

Equilibrium  internuclear  distance,  Ao. 

Temperature  K. 

Electronic  excitation  energy  between  lowest  vibrational  states 
(V  =  0)  of  ground  and  excited  state,  cm-i 
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Ui 

oe,ai 

ojA,  cj3,  tyC 


Vi 

p 

a 


oe 


Vibrational  quantum  number. 

Vibration  rotation  interaction  constants  for  diatomic  molecules, 
an-K 

Vibration  rotation  interaction  constants  for  polyatomic  molecules, 
cm-^. 

Observed  fundamental  free  energy,  cm-^. 

Rotational  stretching  spectroscopic  constant,  K-^ . 

Symmetry  number. 

Zero-order  vibrational  frequency  for  diatomic  molecules,  cm-* . 
Anharmonicity  constants  for  diatomic  molecules,  cm*^ . 
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Thermodynamic  Properties  of  HCOr 
Table"  II 


T 

(  K  ) 

O.lOOOOOE+03 

0.600000E+03 

O.llOOOOE+04 

0.160000E+04 

0.210000E+04 

0.260000E+04 

0.310000E+04 

0.360000E+04 

0.410000E+04 

0.460000E+04 

0.510000E+04 

0.560000E+04 

0.610000E+04 

0.660000E+04 

0.710000E+04 

0.760000E+04 

0.810000E+04 

0.860000E+04 

0.910000E+04 

0.960000E+04 

O.lOlOOOE+05 

0.106000E+05 

O.lllOOOE+05 

0.116000E+05 

0.121000E+05 

0.126000E+05 

0.131000E+05 

0.136000E+05 

0.141000E+05 

0.146000E+05 

0.151000E+05 

0.156000E+05 

0.161000E+05 

0.166000E+05 

0.171000E+05 

0.176000E+05 

0.181000E+05 

0.186000E+05 

0.191000E-r05 

0.196000E+05 


H-BR 

(  j/lDOl  ) 

-0.633885E+04 
0.1i521iE+05 
0.351483E+05 
0.619353E+05 
0 . 903996E+05 
0. 11977 4E+06 
0- 14967 9E+06 
0.178915E+06 
0.210369E+06 
0. 24097 4E+06 
0.271689E+06 
0.302484E+06 
0.333341E+06 
0.364245E+06 
0.395188E+06 
0.426161E+06 
0.457159E+06 
0.488178E+06 
0.519215E+06 
0.550266E+06 
0.581330E+06 
0.612404E+06 
0,643488E+06 
0.674580E+06 
0.705679E+06 
0.736784E+06 
0.767895E+06 
0.799011E+06 
0.830132E+06 
0.861256E+06 
0.892384E+06 
0.923515E+06 
0.954649E+06 
0.985786E+06 
0.101693E+07 
0.104807E+07 
0.107921E+07 
0.111036E+07 
0.114150E+07 
0.117265E+07 


-(G-BR>/T 
(  j/mol  ) 

0.245910E+04 

0.243667E+04 

0.245230E+04 

0.246557E+04 

0.247669E+04 

0.248622E+04 

0.249452E+04 

0.250187E+04 

0.250845E+04 

0.251442E+04 

0.251987E+04 

0.252489E+04 

0.252954E+04 

0.253386E+04 

0.25379iE+04 

0.254171E+04 

0.254530E+04 

0.254869E+04 

0.255190E+04 

0.255496E+04 

0.255788E+04 

0.256067E+04 

0.256333E+04 

0.256589E+04 

0.256835E+04 

0.257071E+04 

0.257299E+04 

0.257519E+04 

0.257731E+04 

0.257937E+04 

0.258136E+04 

0.258328E+04 

0.258515E+04 

0.258697E+04 

0.258873E+04 

0.259045E+04 

0.259211E+04 

0.259374E+04 

0.259532E+04 

0.259687E+04 


CP 

(j/mol) 

0.291158E+02 
0.426881E-f02 
0.510540E+02 
0.556035E+02 
0.580111E+02 
0.593672E+02 
0.601885E+02 
0.607 17 9E+02 
0.610772E+02 
0.613312E+02 
0.615172E+02 
0.616572E+02 
0.617651E+02 
0.618500E+02 
0. 61917 9E+02 
0.619731E+02 
0. 62018 6E+02 
0.620565E+02 
0.620884E+02 
0.621155E+02 
0.621387E+02 
0.621587E+02 
0.621761E+02 
0.621913E+02 
0.622047E+02 
0.622165E+02 
0.622271E+02 
0.622364E+02 
0.622448E+02 
0.622524E+02 
0.622592E+02 
0.622654E+02 
0.622710E+02 
0.622762E+02 
0.622808E+02 
0.622851E+02 
0.622891E+02 
0.622927E+02 
0.622960E+02 
0.622991E+02 


21-22 


Thermodynamic  Properties  of  H30+ 
Table  III 


T  H-HR  -(G-HR)/T  CP 

(  K  )  (  J/mol)  (  J/mole)  (J/mole) 


O.lOOOOOE+03 

0.600000E+03 

6.110000E+04 

0.160000E+04 

0.210000E+04 

0.260000E+04 

0.310000E+04 

0.360000E+04 

0.410000E+04 

0.460000E+04 

0.510000E+04 

0.560000E+04 

0.610000E+04 

0.660000E+04 

0.710000E+04 

0.760000E+04 

0.810000E+04 

0.860000E+04 

0.910000E+04 

0.960000E+04 

O.lOlOOOE+05 

0.106000E+05 

O.lllOOOE+05 

0.116000E+05 

0.121000E+05 

0.126000E+05 

0.131000E+05 

0.136000E+05 

0.141000E+05 

0.146000E+05 

0.151000E+05 

0.156000E+05 

0.161000E+05 

0.166000E+05 

0.171000E+05 

0.176000E+05 

0.181000E+05 

0.186000E+05 

0.191000E+05 

0.196000E+05 


-0.683424E-I-04 

0.124112E+05 

0.374413E+05 

0-676166S+05 

0.101552K+06 

0.1378392+06 

0.175593Ef06 

0.214299E+06 

0.253650E+06 

0.293454E+06 

0.333588E+06 

0.373967E+06 

0.414536E+06 

0.455252E+06 

0.496085E+06 

0.537014E+06 

0.578020E+06 

0.619091E+06 

0.660216E+06 

0.701387E+06 

0.742598E+06 

0.783842E+06 

0.825115E+06 

0.866414E+06 

0.907735E+06 

0.949075E+06 

0.990434E+06 

0.103181E+07 

0.107320E+07 

0.111460E+07 

0.115601E+07 

0.119743E+07 

0.123886E+07 

0.128030E+07 

0.132175E+07 

0.136320E+07 

0.140466E+07 

0.144613E+07 

0.148760E+07 

0.152908E+07 


0.212649E+03 

0.189355E+03 

0.206020E+03 

0.220302E+03 

0.232624E+03 

0.243453E+03 

0.253100E+03 

0.261788E+03 

0.269684E+03 

0-276915E+03 

0.283582E+03 

0.289764E+03 

0.295526E+03 

0.300920E+03 

0.305990E+03 

0.310772E+03 

0.315297E+03 

0.319590E+03 

0.323675E+03 

0.327569E+03 

0.331290E+03 

0.334853E+03 

0.338271E+03 

0.341554E+03 

0.344713E+03 

0.347756E+03 

0.350693E+03 

0.353530E+03 

0.356274E+03 

0.358930E+03 

0.361504E+03 

0.364002E+03 

0.366426E+03 

0.368782E+03 

0.371074E+03 

0.373304E+03 

0.375475E+03 

0.377592E+03 

0.379656E+03 

0.381671E+03 


0.332608E+02 
0.447733E+02 
0.554770E+02 
0.646675E+02 
0.705981E+02 
0.742606E+02 
0.765901E+02 
0.781363E+02 
0.792050E+02 
0.799705E+02 
0.805358E+02 
0.809641E+02 
0.812960E+02 
0.815580E+02 
0.817685E+02 
0.819399E+02 
0.820813E+02 
0.821993E+02 
0.822987E+02 
0.823833E+02 
0.824559E+02 
0.825185E+02 
0.825730E+02 
0.826207E+02 
0.826627E+02 
0.826998E+02 
0.827328E+02 
0.827622E+02 
0.827886E+02 
0. 82812 4E+02 
0.828338E+02 
0.828533E+02 
0.828709E+02 
0.828870E+02 
0.829017E+02 
0.829152E+02 
0.829276E+02 
0.829390E+02 
0.829496E+02 
0.829593E+02 
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Thermodynamic  Properties  of  C2- 
Table  IV 


T 

{  K  ) 


O.lOOOOOE+03 

0.600000E+03 

O.llOOOOE+04 

0.160000E+04 

0.210000E+04 

0.260000E+04 

0.310000E+04 

0.360000E+04 

0.410000E+04 

0.460000E+04 

0.510000E+04 

0.560000E+04 

0.610000E+04 

0,660000E+04 

0.710000E+04 

0.760000E+04 

0.810000E+04 

0.860000E+04 

0.910000E+04 

0.960000E+04 

O.lOlOOOE+05 

0.106000E+05 

O.lllOOOE+05 

0.116000E+05 

0.121000E+05 

0.126000E+05 

0.131000E+05 

0.136000E+05 

0.141000E+05 

0.146000E+05 

0.151000E+05 

0.156000E+05 

0.161000E+05 

0.166000E+05 

0.171000E+05 

0.176000E+05 

0.181000E+05 

0.186000E+05 

0.191000E+05 

0.196000E+05 


H-HR 

(J/mol) 


0.190333E+04 
0.167883E+05 
0.339465E+05 
0.537381E+05 
0.750204E+05 
0.966832E+05 
0-118212E+06 
0. 13947 6E+06 
0.160498E+06 
0. 18133 9E+06 
0.202062E+06 
0.222713E+06 
0.243328E+06 
0.263930E+06 
0.284533E+06 
0.305146E+06 
0.325776E+06 
0.346425E-f-06 
0.367096E+06 
0.387788E+06 
0.408502E+06 
0.429238E+06 
0.449997E+06 
0.470779E+06 
0.491583E+06 
0.512410E+06 
0.533261E+06 
0.554135E+06 
0.575034E+06 
0.595957E+06 
0.616907E+06 
0.637882E+06 
0.658884E+06 
0.679914E+06 
0.700972E+06 
0.722059E+06 
0.743176E+06 
0.764324E+06 
0.785502E+06 
0.806713E+06 


-(G“HR> /T 
(J/ffiOl) 


0-145637E+03 

0.189525E+03 

0.207272E+03 

0.219326E+03 

0.228753E+03 

0.236544E+03 

0.243172E+03 

0.248922E+03 

0.253988E+03 

0.258509E+03 

0.262587E+03 

0.26e300E+03 

0.260706E+03 

0.272853E+03 

0.275776E+03 

0.278506E+03 

0.281066E+03 

0.283477E+03 

0.285755E+03 

0.287915E+03 

0.289967E+03 

0.291922E+03 

0.293790E+03 

0.295577E+03 

0.297291E+03 

0.298936E+03 

0.300520E+03 

0,302045E+03 

0.303517E+03 

0.304939E+03 

0.306314E+03 

0.307645E+03 

0.308936E+03 

0.310188E+03 

0.311404E+03 

0.312586E+03 

0.313736E+03 

0.314855E+03 

0.315945E+03 

0.317009E+03 


CP 

(J/mol) 


0.291055E+02 
0.315195E+02 
0.371805E+02 
0.415357E+02 
0.432107E+02 
0.432796E+02 
0.428002E+02 
0.422 67 lE+02 
0.418408E+02 
0.415455E+02 
0.413606E+02 
0.412573E+02 
0.412105E+02 
0.412006E+02 
0.412142E+02 
0.412421E+02 
0.412782E+02 
0.413189E+02 
0. 41362 OE+02 
0.414061E+02 
0.414507E+02 
0.414955E+02 
0.415404E+02 
0.415857E+02 
0.416314E+02 
0.416777E+02 
0.417248E+02 
0.417729E+02 
0.418221E+02 
0.418725E+02 
0.419242E+02 
0.419773E+02 
0.420317E+02 
0.420877E+02 
0.421450E+02 
0.422038E+02 
0.422641E+02 
0.423258E+02 
0.423889E+02 
0.424533E+02 
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Thermodynamic  Properties  of  C0+ 
Table  V 


T 

(  K  ) 


O.lOOOOOE+03 

0.600000E+03 

O.llOOOOE+04 

0.160000E+04 

0.210000E-(-04 

0.260000E+04 

0.310d00E+04 

0.360000E+04 

0.410000E+04 

0.460000E+04 

0.510000E+04 

0.560000E+04 

0.610000E+04 

0.660000E+04 

0.710000E+04 

0.760000E+04 

0.810000E+04 

0.860000E+04 

0.910000E+04 

0.960000E+04 

O.lOlOOOE+05 

0.106000E+05 

O.lllOOOE+05 

0.116000E+05 

0.121000E+05 

0.126000E+05 

0.131000E+05 

0.136000E+05 

0.141000E+05 

0.146000E+05 

0.151000E+05 

0. 156000E+05 

0..161000E+05 

0.166000E+05 

0.171000E+05 

0.176000E+05 

0.181000E+05 

0.186000E+05 

0.191000E+05 

0.196000E+05 


H-HR 

(J/mol) 


“0.578048E+04 
0.891734E+04 
0.249584E-i-05 
0.422 63 9E+05 
0.602218E+05 
0.785381E+05 
0.970945E+05 
0.115864E+06 
0.134870E+06 
0.154159E+06 
0.173778E+06 
0.193761E+06 
0. 21412 6E+06 
0.234872E+06 
0.255982E+06 
0. 27742 6E+06 
0.299169E+06 
0.321169E+06 
0.343386E+06 
0.365779E+06 
0,388310E+06 
0.410947E+06 
0,433658E+06 
0.456419E+06 
0.479207E+06 
0.502006E+06 
0.524800E+06 
0.547578E+06 
0.570332E+06 
0.593055E+06 
0.615741E+06 
0.638389E+06 
0.660995E+06 
0.683559E+06 
0.706082E+06 
0.728563E+06 
0.751005E+06 
0.773407E+06 
0.795774E+06 
0.818106E+06 


-(G-HR)  /T 
(J/mol) 


0.229123E+03 

0.208895E+03 

0.220413E+03 

0-229635E+03 

0.237134E+03 

0. 24342 6E+03 

0.248839E+03 

0.253587E+03 

0.257820E+03 

0.261640E+03 

0.265127E+03 

0.268338E+03 

0.271319E+03 

0.274103E+03 

0.276719E+03 

0.279188E+03 

0.281527E+03 

0.283752E+03 

0.285873E+03 

0.287901E+03 

0.289845E+03 

0.291710E+03 

0.293504E+03 

0.295232E+03 

0.296897E+03 

0.298506E+03 

0.300061E+03 

0.301565E+03 

0.303022E+03 

0,304435E+03 

0.305805E+03 

0.307136E+03 

0.308429E+03 

0.309687E+03 

0.310910E+03 

0.312102E+03 

0.313263E+03 

0.314395E+03 

0.315499E+03 

0.316576E+03 


CP 

(J/mol) 


0.291041E+02 

0.303617E+02 

0.336090E+02 

0.354052E+02 

0.363347E+02 

0.368935E+02 

0.373240E+02 

0.377620E+02 

0.382784E+02 

0.388935E+02 

0.395935E+02 

0.403463E+02 

0.411145E+02 

0.418633E+02 

0.425650E+02 

0.432000E+02 

0.437569E+02 

0.442310E+02 

0.446228E+02 

0.449364E+02 

0.451785E+02 

0.453569E+02 

0.454799E+02 

0.455558E+02 

0.455925E+02 

0.455971E+02 

0.455759E+02 

0.455347E+02 

0.454781E+02 

0.454103E+02 

0.453347E+02 

0.452542E+02 

0.451709E+02 

0.450869E+02 

0.450036E+02 

0.449222E+02 

0.448^  oE+02 

0.447685E+02 

0.446975E+02 

0.446310E+02 
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Thermodynamic  properties  of  H3+ 
Table  VI 


T 

(  K  ) 


O.lOOOOOE+03 

0.600000E+03 

O.llOOOOE+04 

0.160000E+04 

0.210000E+04 

0.260000E+04 

0.310000E+04 

0.360000E+04 

0.410000E+04 

0.460000E+04 

0.510000E+04 

0.560000E+04 

0.610000E+04 

0.660000E+04 

0.710000E+04 

0.760000E+04 

0.810000E+04 

0.860000E+04 

0.910000E+04 

0.960000E+04 

O.lOlOOOE+05 

0.106000E+05 

O.lllOOOE+05 

0.116000E+05 

0.121000Ef05 

0.126000E+05 

0.131000E+05 

0.136000E+05 

0.141000E+05 

0.146000E+05 

0.151000E+05 

0.156000E+05 

0.161000E+05 

0-166000E+05 

0.171000E+05 

0.176000E+05 

0. 181000E+05 

0. 186000E+05 

0. 191000E+05 

0.196000E+05 


H-HR 

(O'/mol) 


-0.658706E+04 
0.101333E+05 
0.286065E+05 
0.501565E+05 
0.741272E+05 
0.997774E+05 
0. 12665 6E+06 
0.154482E+06 
0.183054E+06 
0.212205E+06 
0.241796E+06 
0.271712E+06 
0.301857E+06 
0.332158E+06 
0.362556E+06 
0,393008E+06 
0,423482E+06 
0.453952E+06 
0.484404E+06 
0. 51482 4E+06 
0.545206E+06 
0.575543E+06 
0.605834E+06 
0.636076E+06 
0.666271E+06 
0.696417E+06 
0.726518E+06 
0.756573E+06 
0.786585E+06 
0.816556E+06 
0.846488E+06 
0.876382E+06 
0.906241E+06 
0.936066E+06 
0. 965859E+06 
0.995622E+06 
0.102536E+07 
0.105506E+07 
0.108475E+07 
0,111440E+07 


-  (G-HR)  /T 
(J/mol) 


0.195679E+03 

0-172686E+03 

0.185782E+03 

0.196522E+03 

0.205584E+03 

0.213452E+03 

0-220421E+03 

0.226685E+03 

0.232379E+03 

0.237603E+03 

0.242429E+03 

0.246916E+03 

0.251107E+03 

0.255039E+03 

0.258741E+03 

0.262238E+03 

0.265552E+03 

0.268698E+03 

0.271694E+03 

0.274552E+03 

0.277284E+03 

0.279900E+03 

0.282409E+03 

0.284820E+03 

0.287139E+03 

0.289372E+03 

0.291527E+03 

0.293608E+03 

0.295619E+03 

0.297565E+03 

0.299451E+03 

0.301279E+03 

0.303053E+03 

0.304776E+03 

0,306451E+03 

0.308080E+03 

0.309666E+03 

0.311211E+03 

0.312716E+03 

0.314185E+03 


CP 

(J/mol) 


0.332574E+02 

0.342431E+02 

0.400884E+02 

0. 45817 9E+02 

0.498143E+02 

0.526414E+02 

0.547812E+02 

0.564584E+02 

0.577733E+02 

0.587849E+02 

0.595417E+02 

0.600889E+02 

0.604681E+02 

0.607158E+02 

0.608 62 9E+02 

0.609346E+02 

0.609508E+02 

0.609269E+02 

0.608749E+02 

0.608037E+02 

0.607200E+02 

0.606287E+02 

0.605335E+02 

0.604369E+02 

0.603409E+02 

0.602467E+02 

0.601552E+02 

0.600671E+02 

0.599825E+02 

0.599018E+02 

0.598250E+02 

0.597520E+02 

0.596829E+02 

0.596174E+02 

0.595554E+02 

0.594969E+02 

0.594415E+02 

0.593892E+02 

0.593397E+02 

0.592929E+02 
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Thermodynamic  Properties  of  H20+ 
Table*  VII 


T 

(  K  ) 


O.lOOOOOE+03 

0.600000E+03 

O.llOOOOE+04 

0.160000E+04 

0.210000E+04 

0.260000E+04 

0.310000E+04 

0.360000E+04 

0.410000E+04 

0.460000E+04 

0.510000E+04 

0.560000E+04 

0.610000E+04 

0.660000E+04 

0.710000E+04 

0,760000E+04 

0,810000E+04 

0.860000E+04 

0.910000E+04 

0.960000E+04 

O.lOlOOOE+05 

0.106000E+05 

O.lllOOOE+05 

0.116000E+05 

0.121000E+05 

0.126000E+05 

0.131000E+05 

0.136000E+05 

0.141000E+05 

0.146000E+05 

0.151000E+05 

0.156000E+05 

0.161000E+05 

0.166000E+05 

0.171000E+05 

0.176000E+05 

0.181000E+05 

0.186000E+05 

0.191000E+05 

0.196000E+05 


H-HR 

(J/mol) 


-0.668326E+04 
0. 10491 4E+05 
0.306992E+05 
0. 54162 4E+05 
0.799952E+05 
0.10'f491E+06 
0.136150E+06 
0.165614E+06 
0.195631E+06 
0.226030E+06 
0.256691E+06 
0.287531E+06 
0.ri8489E+06 
0.349519E+06 
0.380586E+06 
0.411664E+06 
0,442731E+06 
0.473773E+06 
0.504778E+06 
0.535738E+06 
0.566646E+06 
0.597499E+06 
0.628294E+06 
0.659031E+06 
0.689708E+06 
0.720327E+06 
0.750888E+06 
0.781393E+06 
0.811843E+06 
0.842241E+06 
0.872588E+06 
0.902886’- 
0.933137E+Ut; 
0.963343E+06 
0.993506E+06 
0.102363E+07 
0.105371E+07 
0.108376E+07 
0.111377E+07 
0.114375E+07 


-  (G-HR)  /T 
(J/mol) 


0.149109E+03 

0.125531E+03 

0.139387E+03 

0.150961E+03 

0.160745E+03 

0.169230E+03 

0.176730E+03 

0.183454E+03 

0.189550E+03 

0.195123E+03 

0.200255E+03 

0.205011E+03 

0.209439E+03 

0.213582E+03 

0.217473E+03 

0.221141E+03 

0.224608E+03 

0.227895E+03 

0.231019E+03 

0.233995E+03 

0.236836E+03 

0.239553E+03 

0.242157E+03 

0.244655E+03 

0.247057E+03 

0.249368E+03 

0.251596E+03 

0.253746E+03 

0.255822E+03 

0.257831E+03 

0.259775E+03 

0.261659E+03 

0.263486E+03 

0.265260E+03 

0.266983E+03 

0.268658E+03 

0.270288E+03 

0.271876E+03 

0,273422E+03 

0.274929E+03 


CP 

(J/mol) 


0.332574E+02 
0. 36843 9E+02 
0.439456E+02 
0.495773E+02 
0. 53523 OE+02 
0.562955E+02 
0.582227E+02 
0.595497E+02 
0. 60462 9E+02 
0.610927E+02 
0.615251E+02 
0.618156E+02 
0.620010E+02 
0.621070E+02 
0.621522E+02 
0.621508E+02 
0. 62113 9E+02 
0.620502E+02 
0.619668E+02 
0.618693E+02 
0.617621E+02 
0.616488E+02 
0.615320E+02 
0.61^:.38E+02 
0.612S60E+02 
0.611796E+02 
0. 61065 6E+02 
0.609547E+02 
0.608473E+02 
0.607437E+02 
0.606440E+02 
0.605483E+02 
0.604567E+02 
0.603692E+02 
0.602855E+02 
0.602057E+02 
0. 601296E+02 
0.600570E+02 
0.599879E+02 
0.599220E+02 
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NON-INTRUSIVE  TESTING  OF  COMPOSITE 
AIRCRAFT  ENGINE  COMPONENTS:  I 


Laurence  J.  Jacobs,  Assistant  Professor 
Engineering  Science  and  Mechanics  Program 
Georgia  In':titute  of  Technology 


AITSTRACT 

Advanced,  high  temperature  conmosite  engine  components  will  add  new 
complications  to  established  testing  procedures  and  increase  the  need  for  innovative 
non-intrusive  technologies.  Lightweight  composite  materials  are  inhomogeneous  and 
anisotropic  in  nature;  these  conditions  complicate  any  potential  structural  model  and 
question  the  validity  of  surface,  dynamic  stress  measurements.  Laser  ultrasonics  has 
the  capability  to  provide  useful  engineering  information  about  the  structural  integrity 
of  an  in-service  engine  component  without  interfering  with  the  process  being 
monitored.  Since  elastic  waves  propagate  through  the  component  thickness,  laser 
ultrasonics  examines  the  condition  of  the  entire  specimen  and  not  just  a  limited 
number  of  surface  points.  Ultrasonics  is  a  powerful  tool  for  the  characterization  of 
structural  materials,  but  to  be  effective,  the  propagation  characteristics  of  the  elastic 
waves  themselves  must  be  understood.  This  report  concludes  with  a  review  of  the 
basic  aspects  of  wave  propagation  in  an  anisotropic  media  and  discusses  the  potential 
for  material  characterization  using  laser  ultrasonics. 


INTRODUCTION 


The  use  of  composite  materials  for  propulsion  systems  will  dramatically 
increase  the  need  for  new  technologies  for  engine  testing  and  certification.  There 
are  a  variety  of  potential  problems  if  methodologies  currently  in  use  for  "traditional" 
engine  materials  are  applied  to  these  new,  high  performance  materials.  The  purpose 
of  this  report  is  to  examine  the  potential  of  innovative,  non-intrusive  techniques  to 
monitor  the  structural  integrity  of  propulsion  components.  Of  primaiy  concern  is  the 
ability  of  the  methodology  to  measure  useful  engineering  values  which  can  be  related 
to  the  component’s  integrity.  Part  I  of  this  report  introduces  the  proposed  system, 
laser  ultrasonics,  and  discusses  in  depth,  its  application  to  a  potential  advanced 
composite  engine  material,  Avimid-N.  The  second  part  of  this  report  (authored  by 
graduate  student  James  Abbey)  summarizes  other  potential  nondestructive  evaluation 
techniques,  examines,  in  depth,  the  laser  generation  of  ultrasound  and  reviews  the 
candidate  composite  material  types. 

PROBLEM  STATEMENT  AND  PROPOSED  SOLUTION 

Lightweight,  high  temperature  composite  engine  materials  have  the  potential 
to  provide  significant  economic  and  performance  benefits.  There  are,  however, 
many  problems  associated  with  the  use  of  these  new  materials;  chief  among  them 
is  that  the  structural  behavior  of  a  composite  can  be  very  different  from  that  of  a 
metal  component.  A  composite’s  inherent  anisotropy  governs  both  its  static  and 
dynamic  structural  behavior.  This  anisotropy,  which  is  tailored  by  the  designer  to 
obtain  maximum  structural  performance,  must  be  properly  accounted  for  in  any 
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evaluation  procedure. 


The  current  engine  testing  technology,  which  is  very  well  established  and  has 
been  field  proven,  uses  dynamic  strain  gages  to  directly  obtaiu  measurements  of 
certain  critical  parts.  There  are  significant  potential  problems  concerning  the 
application  of  this  technique  for  composite  components.  The  primary  difficulty  is 
that  strain  gages  make  surface  measurements  at  a  limited  number  of  points;  these 
values  are  used,  with  the  appropriate  structural  analysis,  to  infer  the  state  of  stress 
throughout  the  component.  The  limited  life  of  the  gages,  their  associated  wire  paths 
and  the  expense  involved  in  their  installation  are  additional  disadvantages  for  their 
use  with  these  new  composite  materials.  The  effectiveness  of  a  surface  measurement 
of  a  composite  material  is  debatable  because  of  the  material’s  anisotropic  behavior 
[1],  Additionally,  detailed  reliable  analysis  packages  are  not  currently  available  for 
these  advanced  materials,  so  the  usefulness  of  a  limited  number  of  single  point  strain 
measurements  is  doubtful  for  composite  engine  components. 

Laser  ultrasonics  [2]  is  a  non-contact,  non-intrusive,  nondestructive  evaluation 
technique  that  uses  lasers  to  generate  and  detect  ultrasonic  waves.  The  physical 
principles  underlying  the  generation  and  reception  of  laser  ultrasound  in  metallic 
specimens  is  fairly  well  understood.  Briefly,  a  certain  amount  of  the  incident 
electromagnetic  energy  from  the  laser  is  absorbed  at  the  surface  of  the  specimen  and 
converted  into  thermal  energy.  The  nanosecond  time  scale  of  a  Q-switched  laser 
pulse  causes  the  thermal  wavefield  to  extend  only. a  few  millimeters  into  the 
specimen.  This  nearly  instantaneous  thermal  expansion  generates  elastic  waves 
which  propagate  into  the  specimen.  The  specimen’s  response  to  these  elastic  waves 
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is  measured  with  a  laser  interferometer.  This  optical  device  permits  the  high  fidelity 
localized  detection  of  ultrasonic  waves  without  disturbing  the  process  being 
monitored.  A  schematic  of  the  laser  ultrasonic  process  is  given  in  Figure  1.  Part  n 
of  this  report  provides  a  detailed  description  of  the  laser  generation  of  ultrasound 
and  its  application  to  high  temperature  composite  materials. 

The  main  advantages  of  laser  ultrasonics  for  the  non-intrusive  testing  of 
composite  engine  components  include: 

(1)  It  is  a  totally  non-contact  technique,  so  it  does  not  interfere  with  the 
process  being  monitored.  In  addition,  the  adverse  temperature  and  vibration 
environment  does  not  cause  excessive  difficulties.  There  is  also  no  need  for 
complicated  preparation  or  time  consuming  pre-test  installations. 

(2)  The  method  provides  information  about  the  condition  of  the  material 
throughout  the  thickness  of  the  component,  not  just  at  a  single  point  on  its 
surface.  This  information  includes  wave  velocities,  attenuation  and  dispersion 
which  are  used  to  calculate  meaningful,  engineering  data  on  the  structural 
integrity  of  the  component  such  as  temperature  variation  and  elongation. 

(3)  The  generation  and  detection  of  ultrasound  is  accomplished  using  fiber 
optic  cables.  These  leads  allow  for  easy  access  into  the  test  cell  and  enables 
the  system  to  interrogate  difficult  to  reach  engine  components  during  testing. 

At  the  present  time,  laser  ultrasonics  has  proven  to  be  an  effective  tool 
for  the  laboratory  monitoring  of  metallic  components.  An  enormous  effort 
is  required  in  order  for  it  to  make  the  transition  from  a  laboratory  technique 
to  the  in-service  monitoring  of  rotating,  composite  engine  components.  This 
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report  sucmsiizcr  e.foits  to  establish  ibe  aiialytica]  fbodai&sMals  fm  the 
^licadmi  of  laser  ultrasonics  to  a  tenqteiature,  oiganic  matrix 
conqxiate  materia!,  Avimid-N.  The  next  step,  rrindi  is  the  saiject  of  the 
author’s  proposed  folIoHHip  mim-gian^  involves  bendunaik  laboratory 
eaqjerimeiits  to  establish  the  Ability  of  the  jiser  generation  and  qnanlitative 
detection  of  ultrasoimd  in  Avimid^.  Samples  of  this  material  have  already 
been  procured  from  the  manufacturer  (see  .^pendix)  for  tilns  proposed, 
etperimental  study. 

LASER  DEIECnON  OF  ULTRASOUND 

It  is  possible  to  combine  the  laser  generation  of  ultrasound  with  piezoelectric 
transducers,  electromagnetic  acoustic  transducers  (EMAT)  or  capacitive  transducers, 
but  a  truly  non'intrusive  system  is  only  established  when  a  remote,  optical  system 
[3,4]  is  used  to  detect  the  ultrasonic  waves;  this  optical  system  is  the  csfy  viable 
option  for  the  non-intrusive  testing  of  aircraft  engine  components.  An  example  of 
a  heterodyne  interferometer  is  shown  in  Figure  2.  This  optical  ^tem  is  operated 
by  splitting  a  single  frequency  laser  light  into  two  components  using  an  acousto-optic 
modulator.  These  two  components,  which  are  separated  in  frequency  by  40MHz,  are 
sent  along  two  anns  of  an  interferometer,  one  of  which  contains  the  sample  being 
monitored.  The  face  of  the  specimen  serves  as  one  mirror  in  the  interferometer. 
The  beams  are  recombined  on  the  surface  of  a  photodetector  producing  a  beat 
frequency  of  40MHz.  Phase  shifts  in  the  h'ght  reflected  from  the  sample’s  surface 
result  in  proportional  phase  shifts  in  the  beat  signal.  As  a  result,  the  40MHz  signal 
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acts  as  a  earner  that  can  be  demoduiaied  to  detenmse  the  time  dependent  velodiy 
ocenning  at  tbe  san^le’s  surface.  The  signal  at  the  photodetector  can  be 
demodulated  in  real  liine  using  an  FM  discriminator.  Tbe  demodulated  output 
signal  ib  intuitional  to  the  normal  surface  velod^  of  the  specimen  and  can  be 
integrated  to  determine  its  time  dependem  displacement 

ELASne  WAVE  PROPAGATlCeJ  CONSIDERATIONS 

Ultrasonics  is  a  very  poweifol  tool  for  the  characterization  of  structural 
materials.  In  order  to  use  ultrasonic  waves  to  effectively  probe  a  sample,  the 
propagation  characteristics  of  the  ultrasonic  waves  themselves  must  be  fully 
understood-  It  is  necessary  to  review  the  basic  aspects  of  wave  propagation  in 
anisotropic  materials.  Spedffcally,  it  is  critical  to  determine  what  can  be  physically 
measured  and  what  can  be  learned  from  tbe  ultrasonic  evaluation  of  a  composite 
specimen.  Here  the  anisotropic  nature  of  the  composite  material  plays  the  central 
role.  An  isotropic  material,  such  as  most  metals,  has  only  two  independent  elastic 
constants  and  does  not  possess  any  directional  preferences;  elastic  wave  propagation 
in  isotropic  materials  is  fairly  well  established  [5].  In  contrast,  the  most  general 
anisotropic  material  has  twenty-one  independent  stiffiiess  (elastic)  constants;  now  a 
strong  directional  preference  exists.  In  recent  years,  there  has  been  a  great  deal  of 
interest  in  wave  propagation  in  anisotropic  material  in  seismology  [6,7],  crystal 
acoustics  [8]  and  electromagnetic  fields  [9].  In  general,  three  different  linear  elastic 
waves  may  propagate  along  any  given  direction  in  an  anisotropic  material  and  each 
of  these  waves  can  travel  with  distinct  phase  velocities.  An  additional  complication 
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is  that  the  energy-velodty  vectors  and  the  phase>velodty  vectors  for  plane  wave 
propagation  are  the  same  for  an  isotropic  material,  while  in  general  they  are 
different  for  an  anisotropic  material  This  means  that  in  an  anisotropic  material  the 
acoustic  energy  does  not  propagate  along  the  wave  front  direction;  this  difference 
will  greatfy  affect  ai^  ultrasonic  measurements  made. 

In  general  a  layered  composite  component  can  be  modeled  as  an  orthotropic 
material;  here  nine  independent  elastic  constants  are  required  to  specify  the  stress- 
strain  relationship.  An  additional  level  of  symmetry,  transverse  isotropy,  exists  for 
a  unidirectional  (all  of  the  reinforcing  fibers  are  aligned  in  a  single  direction) 
composite  specimen  and  reduces  the  number  of  independent  elastic  constants  to 
five.  Ligbtweighl  fiber  reinforced  composites  are  not  homogeneous  materials. 
However,  most  approaches  to  the  mechanics  of  composite  materials  begin  with  the 
simplifying  assumption  that  the  composite  material  may  be  treated  as  homogeneous. 
For  the  ultrasonic  evaluation  of  composites,  if  the  layer  thickness  and  fibers  are 
small  compared  to  the  wavelengths  of  the  ultrasonic  waves,  the  inherent 
inhomogeneity  of  the  fiber  reinforced  composite  can  be  ignored  and  the  material 
treated  as  homogeneous. 

The  time  harmonic  equations  of  motion  for  a  general  anisotropic  media 

« 

reduce  to 


detjr.-y  - =  n  =  =  0  (1) 

Here  the  constants  Ty  are  known  as  the  Christoffel  stiffnesses,  and  can  be  written 
in  terms  of  the  material’s  twenty-one  independent  elastic  stif&iess  components. 
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Equation  (1)  describes  three  velocity  fronts  in  space  for  any  given  direction.  The 
physical  interpretation  of  this  means  that  for  any  direction  of  wave  propagation, 
there  are  three  distinct  phase  velocities  and  their  three  corresponding  displacement 
values  are  orthogonal  As  opposed  to  the  isotropic  case,  these  displacements  are 
neither  truly  longitudinal  nor  truly  transverse.  These  phase  velocities  are  best 
presented  in  terms  of  their  inverse  values,  or  slownesses.  For  example,  examine  the 
clastic  wave  propagation  in  a  transversely  isotropic  material  (the  model  for  a 
unidirectional  composite).  Equation  (1)  reduces  to  calculating  the  determinant  of 
a  three  by  three  system  of  equations.  For  the  simplest  direction  of  propagation, 
perpendicular  to  the  axis  of  symmetiy  (the  fiber  direction),  the  wave  speeds  are 
independent  of  the  direction  of  propagation  in  that  plane  Oiily.  This  is  shown  in  a 
plot  of  their  slowness  curves  (inverse  of  the  phase  velocities),  given  in  Figure  3. 

Instead  of  traveling  with  the  phase  velocity,  the  energy  of  the  ultrasonic  waves 
in  an  anisotropic  media  will  travel  with  the  energy  velocity,  which  is  identical  to  the 
group  velocity  (in  a  lossless  media).  This  difference  must  be  accounted  for  in  the 
design  of  an  effective  ultrasom'c  evaluation  experiment  in  an  anisotropic  medium. 
The  group  velocity  vector,  and  thus  the  energy  velocity  vector,  is  calculated  from  the 
dispersion  relation,  equation  (1),  by 


(2) 


The  group  velocity  components  are  the  values  that  are  physically  measured  and  serve 
as  the  basis  for  the  calculation  of  the  phase  velocity  (or  slowness)  components.  The 
phase  velocities  are  more  useful,  since  the  solution  to  all  problems  of  wave 
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propagation  are  tied  to  their  values. 

In  addition  to  the  bulk  wave  velocity  considerations  just  discussed,  the 
possibility  of  using  guided  waves  [10-14]  for  the  ultrasonic  inspection  of  composite 
materials  must  be  considered.  These  plate  waves  may  provide  useful  information 
for  the  analysis  of  thin  composite  layers.  Here,  structural  vibration  considerations 
dominate  and  values  of  resonance  frequencies  and  dispersion  curves  are  indirectly 
used  for  material  characterization.  The  use  of  these  plate  waves  depends  more  on 
the  component  geometry  (plate  thickness  and  boundary  conditions)  and  occur  on  a 
slower  time  scale  than  the  bulk  waves.  Dispersion  curves  (frequency  versus  phase 
velocity)  are  available  [12,13]  and  can  be  incorporated  into  any  testing  scheme. 


C^ULATION  OF  ELASTIC  CONSTANTS 

The  required  starting  point  for  the  application  of  laser  ultrasonics  to  the 
characterization  of  high  temperature,  composite  aircraft  engine  materials  is  the 
experimental  validation  of  the  technique  for  the  material  being  considered,  Avimid- 
N.  The  answer  to  the  question  of  whether  laser  generation  and  detection  is  possible 
with  this  material  is  best  determined  in  a  laboratory  environment.  In  the  laboratory, 
the  first  step  in  the  ultrasonic  inspection  of  an  anisotropic  material  is  to  accurately 
acquire  its  material  properties,  specifically,  the  matrix  of  its  elastic  constants  which 
describe  the  homogeneous  medium  of  a  perfectly  manufactured  material  [15-17]. 
The  case  of  a  unidirectionally  reinforced  composite,  modeled  as  a  transversely 
isotropic  material,  with  z  as  the  axis  of  symmetry  (as  shown  in  Figure  4),  comprises 
determining  the  five  independent  elastic  constants 
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(3) 


This  is  the  proposed  symmetry  model  for  unidirectional,  carbon  reinforced,  Avimid- 
N.  It  is  important  to  note  that  these  elastic  constants  directly  depend  upon  the 
phase  velocities,  as  given  in  equation  (1). 

Besides  being  a  group  of  material  constants,  these  stiffness  components 
(equation  (3))  contain  information  about  the  reference  state  of  the  material  in  a 
particular  neighborhood  and  can  be  used  to  determine  changes  from  this  reference 
state.  Once  the  reference  state  of  the  material  is  completely  characterized,  the 
ultrasonic  waves  can  be  used  to  detennine  defects  such  as  voids,  delaminations  and 
porosity  as  well  as  potentially  detect  some  more  subtle  changes  like  temperature 
variation,  elongation  and  state  of  stress,  ITiese  differences,  which  will  appear  as 
changes  in  the  measured  ultrasonic  waveform’s  shape,  phase  velocity,  attenuation  and 
dispersion  will  be  discussed  in  detail  in  the  next  section.  A  more  empirical  approach 
to  material  characterization  using  ultrasonic  waves,  the  stress  wave  factor,  is 
discussed  in  part  II  of  these  report. 
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Hie  anisotropy  of  the  material  causes  enormous  complications  in  any 
ultrasonic  investigation.  The  fundamental  aspects  of  wave  propagation  are  effected, 
including  energy  velocity,  beam  splitting,  beam  skewing,  untymmetrical  field  profiles, 
imusual  side  lobes  and  beam  divergence.  These  can  all  cause  errors  in  accurate 
material  characterization.  This  heightens  the  importance  of  understanding 
anisotropic  wave  propagation,  as  discussed  in  the  previous  section. 

The  experimentally  obtained  phase  velocities  are  verified  using  the  previously 
determined  numerical  results,  which  account  for  the  difference  between  energy  and 
phase  velocity  directions.  This  involves  the  solution  of  a  system  of  nonlinear 
algebraic  equations  for  the  unknown  phase  velocities.  These  results  are  used  to  fill 
in  the  elastic  constant  matrix  of  equation  (3).  A  tentative  procedure  for  the 
experimental  determination  of  the  elastic  constants  of  a  transversely  isotropic 
(unidirectional)  composite  material  is: 

(1)  Measure  the  longitudinal  phase  velocity  along  the  fiber  direction  to 
evaluate  constant  C33. 

(2)  Measure  the  longitudinal  phase  velocity  perpendicular  to  the  fiber 
direction  to  evaluate  constants  Cj,  and  C22. 

(3)  Measure  the  vertically  polarized  shear  velocity  perpendicular  to  the  fibers 
to  evaluate  constants  C44  and  C55. 

(4)  Measure  the  horizontally  polarized  shear  velocity  along  the  fiber  direction 
to  evaluate  (calculate  Cj2“Cjj”2Cj^). 

(5)  Measure  the  quasi-longitudinal  phase  velocities  at  various  angles  to  the 
fiber  direction  to  evaluate  and  C23. 
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So,  by  taking  advantage  of  the  symmetry  of  the  transversely  isotropic  composite  and 
varying  the  incident  anglt',  redundancies  can  be  utilized  to  check  the  validity  of  the 
measurements. 

MATERIAL  CHARACTERIZATION  USING  ULTRASONICS 

The  overall  stiffness  matrix  of  equation  (3)  provides  a  frame  of  reference  to 
begin  quantitative  material  characterization.  This  is  the  most  critical  application  for 
aircraft  engine  testing  applicarions.  Since  laser  ultrasonics  provides  a  very  broad 
band  signal,  it  is  particularly  well  suited  for  detecting  localized  changes  in  elastic 
properties.  Traditional  ultrasonics  has  been  used  to  detect  changes  in  temperature 
[18],  microstructure  [19]  and  to  measure  residual  stresses  [20].  The  technique  has 
proven  to  be  quite  effective  in  characterizing  and  monitoring  a  material’s 
microstructure,  even  during  fabrication  processes.  Because  the  ultrasonic  waves 
propagate  through  the  thickness  of  the  specimen,  laser  ultrasonics  provides 
information  about  the  condition  of  the  entire  component. 

The  relationship  between  applied  stress  and  the  resultant  strain  in  a  solid 
material  is  usually  considered  to  be  linear.  But  even  within  the  elastic  range,  small 
departures  from  linearity  exist;  this  means  that  the  elastic  constants  vary  slightly  with 
stress.  The  variation  of  the  constants’  values  can  be  very  small,  on  the  order  of 
fractions  of  a  percent.  Acoustoelasticity  [20]  uses  elastic  (ultrasonic)  waves  to 
quantify  these  small  changes  in  stress.  As  shown  in  the  previous  section,  the  phase 
velocity  of  elastic  waves  in  a  medium  are  dependent  on  the  elastic  constants,  so  it 
is  possible  to  determine  the  state  of  stress  using  accurate  measurements  of  changes 
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in  wave  velocities.  Acoustoelasticity  is  based  on  a  continuum  theoiy  of  small 
disturbances  (the  ultrasonic  waves)  superimposed  on  an  elastically  deformed  body 
(the  engine  component).  The  advantage  of  laser  ultrasonics  is  the  accuracy  of  its 
measurements  and  its  repeatability,  which  allows  for  the  use  of  time  averaging 
schemes.  This  is  an  area  that  demands  further  research,  after  the  experimental 
measurement  of  Avimid-N’s  "reference"  elastic  constants. 

The  temperature  variation  through  the  thickness  is  not  based  on  a  direct 
measurement;  instead  temperature  dependent  physical  properties  are  used.  Again, 
accurate  measurement  of  the  elastic  constants  is  necessary.  The  laser  ultrasonic 
method  is  capable  of  making  the  required  time-of-flight  (phase  velocity) 
measurements.  Extreme  caution  is  required  since,  like  the  acoustoelastic  effect,  the 
expected  changes  are  very  small  and  are  dependent  on  other  factors  such  as 
microinhomogeneity  and  texture.  For  these  reasons,  the  use  of  the  ultrasonic 
technique  to  measure  initial  stresses  and  temperature  variation  is  still  being 
developed.  This  does  not  preclude  the  application  of  laser  ultrasonics  for  the  non- 
intrusive  evaluation  of  composite  engine  components;  the  technique  is  unmatched 
in  its  potential  for  this  purpose. 

CONCLUSION 

Laser  ultrasonics  is  a  powerful  tool  for  the  characterization  of  structural 
materials.  The  method  can  provide  relevant  engineering  data  about  the  structural 
integrity  of  the  entire  component  without  interfering  with  the  process  being 
monitored.  Its  potential  to  investigate  in-service,  high  temperature  composite 
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materials  is  unmatched.  To  be  effective,  the  propagation  characteristics  of  elastic 
waves  in  an  anisotropic  media  must  be  examined.  The  waveform,  phase  velocity, 
attenuation  and  dispersion  of  these  ultrasonic  waves  contain  usehil  information  about 
the  structural  integrity  of  an  engineering  component  The  required  first  step  for  the 
application  of  laser  ultrasonics  to  the  characterization  of  high  temperature,  organic 
composite  materials  is  the  laboratory  determination  of  the  candidate  material’s, 
Avimid-N,  elastic  constants  (as  given  in  equation  (3)).  This  will  not  only  validate  the 
laser  ultrasonic  method  for  Avimid-N,  but  also  provide  information  about  the 
reference  state  of  this  material  in  a  particular  neighborhood.  Once  this  reference 
state  is  completely  characterized,  elastic  waves  can  be  used  to  detect  subtle  changes 
from  this  reference  state  such  as  elongation  and  temperature  variation.  It  is  these 
values  that  provide  useful  engineering  information  to  test  and  analysis  engineers. 
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Figure  2.  Inter!  srometer 
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Figure  3.  Typical  Slowness  Curve 
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Figure  4.  Unidirectional  Composite 
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APPENDIX 


Key  Phone  Congeisatfons 
L  Dr,  Didi  Gomelia,  Dupont  (3Q2)  999-2498 

He  is  providing  us  with  sample  Avimid-N  temperature,  polyimide)  specimens. 
This  material  should  be  used  by  Pratt  and  ¥/hitn^,  reinforced  with  carbon  fibers. 

2.  Dr.  Tom  Harding,  Dupont 

Discussed  Dupont’s  NDE  procedures  for  the  Avunid-N.  Oirrently  using  ultrasonic 
immersion  techniques  for  defect  detection. 

3.  Mr.  Dick  Froom,  McQelland  Air  Fon^  Base,  California 
(916)  643-4274  (commercial),  633-4274  (autovon) 

He  is  developing  a  laser  ultrasonic  scanning  system  for  the  Air  Force.  Two  systems 
are  available,  one  that  is  capable  of  interrogating  an  entire  aircraft  and  a  second  that 
looks  at  individual  components. 

4.  Henry  Jones,  AEDC  (454-7750) 

Discussed  accessibility  issue  of  the  fiber  optic  probes  in  an  engine.  Manufacturer 
will  provide  access  holes  (at  locations  of  their  choosing)  through  the  engine  case. 
Provided  helpful  information  based  on  his  experience  with  laser  monitoring  of  blade 
tip  deflections. 

5.  Carl  Brasier,  AEDC  (454-4661) 

Discussed  current  laser/optical  capabilities  available  at  Arnold.  There  exists,  in 
house,  sufficient  hardware,  including  lasers,  optical  components  and  data  acquisition 
systems  for  any  future  follow-on  demonstration  at  AEDC. 
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IMPLEMENTATION  OF  MULTIGRID  IN  THE  PARC  CODE 


Steven  M.  McKay 

AbstracL  The  PARC  code  is  a  general  purpose  Navier  Stokes  solver  developed  at  Arnold 
Engineering  Development  Center  at  Arnold  Air  Force  Base,  Teimessee.  It  is  used 
continuously  in  design  and  analysis  of  flow  characteristics  of  aircraft.  However,  these  types 
of  flows  are  very  complicated  and  difficult  to  solve.  Time  dependent  multigrid  has  been 
implemented  in  the  PARC  code  which  accelerates  convergence  to  steady  state.  Convergence 
of  the  multigrid  solver  is  on  the  order  of  convergence  of  the  coarsest  grid. 

1.  Introduction. 

In  the  past  decade  computational  speed  and  memory  capacity  has  increased  greatly. 
This  has  allowed  for  the  solution  of  many  large  scale  problems  which  previously  were 
considered  intractable.  However,  there  are  still  a  large  class  of  problems  for  which  solution 
is  obtained  only  after  great  expense  in  terms  of  both  computer  time  and  man  hours.  Hence, 
there  is  a  critical  need  to  apply  a  solution  technique  which  is  efficient  in  terms  of  memory 
and  provides  a  fast  solution. 

One  such  problem  is  that  of  the  development  of  an  optimal  free-jet  forebody  design. 
Due  to  the  nature  of  the  problem,  optimization  of  a  free-jet  forebody  design  simulator  can 
only  be  accomplished  by  use  of  direct  function  optimization.  In  this  case,  the  optimization 
is  not  cast  as  part  of  the  fluid  flow  analysis.  Instead  measurements  are  made  from  a  CFD 
solution  of  the  fluid  flow  to  determine  the  change  in  design.  This  type  of  optimization  is 
naturally  very  expensive  as  it  requires  many  calculations  of  the  fluid  flow  to  obtain  a 
solution.  Therefore  an  improvement  in  the  speed  of  such  solutions  would  greatly  benefit 


23-1 


the  above  project 

Multigrid  is  a  technique  used  to  accelerate  convergence  of  solutions  of  partial 
differential  equations.  Traditionally,  multigrid  has  been  used  in  the  acceleration  of  the 
convergence  of  iterative  matrix  solvers.  Recently  good  success  has  been  achieved  by 
applying  multigrid  like  algorithms  to  the  Navier  Stokes  equations  and  accelerating 
convergence  to  steady  state.  Multigrid  can  provide  dramatic  acceleration  of  the  convergence 
process  thus  yielding  a  more  efficient  solution. 

Another  advantage  of  multigrid  is  the  ability  to  use  solvers  already  existing  in  code 
to  find  a  solution.  Multigrid  can  be  thought  of  as  an  outer  algorithm;  the  solver  need  not 
be  specified.  Though  some  compensation  must  be  made  for  different  solvers,  this  aspect  of 
multigrid  lends  itself  to  robust  code. 

In  this  paper  we  describe  the  implementation  of  multigrid  in  the  PARC  code.  We 
will  show  that  there  can  be  a  tremendous  savings  in  cost  by  use  of  the  described  algorithm. 
In  section  2  we  give  a  brief  introduction  to  the  Navier  -  Stokes  equations.  We  describe  the 
PARC  code  in  section  3,  and  the  multigrid  algorithm  in  section  4.  In  section  5  we  give  a 
sample  problem,  and  in  section  6  we  give  our  conclusions. 

2.  The  Navier  -  Stokes  Equations. 

By  applying  certain  assumptions  to  fluid  flow  through  a  body,  the  Navier  -  Stokes 
equations  can  be  derived.  (For  greater  detail,  we  refer  the  reader  to  [1]).  For  two 
dimensional  flow,  they  can  be  written  in  vector  form  as 


where 
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is  the  viscous  stress  tensor  and 


_ K  dT 

p,Pr  dXj 


is  the  heat  flux  vector. 

Note  that  there  are  four  equations  and  six  variables  of  interest.  In  order  to  simplify  the 


above  equations,  certain  assumptions  are  made. 
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First,  Stokes  hypothesis, 


is  used  in  .  Next,  the  PARC 

code  makes  the  assumption  that  the  fluid  is  thermally  and  calorically  perfect.  Pressure  and 
temperature  are  then  calculated  by  the  following  equations: 

P=(Y-1)  (K-l/2p(Ui^+u|)) 
r=Y(Y-l)  (S/p~l/2(ui^+u|)) 

Y  is  the  ratio  of  specific  heats.  The  above  equations  relate  the  pressure  and  temperature 

to  the  conser/aticn  variables  and  reduces  (1)  to  a  system  of  four  equations  with  four 
unknowns.  The  difficulty  in  the  Navier  -  Stokes  equations  is  in  the  nonlinear  relationships 
of  the  consemtion  variables.  This  leads  to  systems  which  are  difficult  to  solve. 

3.  The  PARC  code. 

The  PARC  code  is  a  general  Navier  -  Stokes  solver,  Tne  current  implementation  has 
tlrree  main  areas  of  interest,  namely,  implementation  of  boundary  conditions,  types  of 
solvers,  and  grid  blocking. 

Boundary  conditions  are  currently  implemented  in  the  PARC  code  to  aUov,^  for 
internal  or  external  flows,  solid  boundajies  of  different  types,  and  symmetry  boundaries. 
These  have  been  implemented  for  subsonic  and  supersonic  flow.  All  boundary  conditions 
currently  implemented  are  done  via  direct  solution  algorithms.  That  is,  the  consemtion 
variables  at  the  boundary  are  solved  using  only  information  from  the  previous  solution.  The 


23-4 


|T7~ITT'ainn  ~ii  IfijUII  t  li  |['  j 


large  number  of  boundary  conditions  available  lead  to  a  versatile  code  which  can  solve  a 
large  variety  of  problems. 

There  are  two  solvers  currently  implemented  in  the  PARC  code.  These  are  an 
implementation  of  the  Beam-Warming  algorithm,  and  pseudo  Runge-Kutta.  The  Beam- 
Warming  algorithm  implemented  is  Euler  backward  differencing.  The  system  of  equations 
can  be  written  as 


riJ^l 


dxi  dx2 


1 

Re 


(  dGi+1 


dxj  dx. 


2  / 


=0 


where  A0=Q’^*^-0"  and  .  Fj*^  is  approximated  by  a  first  order 


taylor  polynomial  Fj*^"Fj+Afil^Q  where 


The  viscous  flux  vectors  are 


then  lagged,  i.e.,  .  This  leads  to 
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The  derivatives  in  (2)  are  approximated  by  central  difference  operators.  This  leads  to  a 
complex  system  of  equations.  In  order  to  alleviate  the  difficulty  presented  by  such  a  matrix, 
the  left  hand  side  is  approximately  factored  and  an  ADI  algorithm  is  used  to  find  a  solution. 
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Because  central  differencing  is  used,  artificial  viscosity  is  required  for  stability.  We  refer  the 
reader  to  [2]  for  a  discussion  of  the  artificial  viscosity  used.  For  a  complete  discussion  on 
the  Beam  -  Wanning  algorithm,  see  [1],  pp.  489-494. 

As  can  be  seen  from  the  above.  Beam  Warming  is  an  implicit  solver;  the  equations 
of  the  conservation  variables  involve  values  at  neighboring  points.  The  pseudo  Rimge-Kutta 
algorithm,  however,  is  an  explicit  solver;  the  solution  of  the  conservation  variables  depends 
only  on  the  solution  at  previous  time  steps.  This  provides  a  less  costly  solution.  However, 
the  allowable  time  step  is  limited  by  the  CFL  condition  to  maintain  stability. 

Pseudo  Runge-Kutta  is  in  fact  not  related  to  the  Runge-Kutta  method  for  solution 
of  ordinaiy  differential  equations.  It  is  in  fact  a  multistep  time  integration  scheme.  In  the 
PARC  code  three,  four  or  five  stage  schemes  are  implemented. 

The  five  stage  scheme  is  as  follows: 

C>(0)=C>” 

Q{l)=Q{0)*^^tR(Oi0)) 

4 

D 

QO)=0{2)*^^tRiQ{2) ) 
o 

C»(4)=C’(3)+AAti?(p(3) ) 
p«*i=C)(4)+Ati?(C>(4)) 

where  R{Q[I))  is  the  right  hand  side  calculation.  It  should  be  noted  that  the  Runge- 

Kutta  algorithm  has  difficulty  in  converging  to  steady  state  if  implemented  as  above.  To 
accelerate  convergence,  implicit  lesidual  smoothing  is  applied  (see  [2]). 

The  ciurent  implementation  of  the  PARC  code  also  allows  the  user  to  separate  a 
large  problem  into  blocks,  each  of  which  can  be  solved  separately.  This  allows  for  solution 
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of  very  large  problems  in  systems  with  limited  memory.  After  one  or  more  time  steps  in  a 
block,  the  boxmdary  information  is  stored  so  that  contiguous  blocks  may  use  it. 

It  should  also  be  mentioned  that  the  PARC  code  implements  metrics  to  enable 
solutions  to  be  calculated  on  general  grids.  As  this  has  only  minor  impact  on  the  algorithm 
we  present  here,  we  will  not  discuss  it  in  this  paper. 

4.  Multigrid. 

There  are  basically  two  types  of  multigrid  methods  which  are  implemented  today. 
The  first  is  used  to  accelerate  the  convergence  of  an  implicit  matrix  solver  operating  on  a 
system  of  difference  equations  used  to  approximate  solutions  to  partial  differential  equations 
or  systems  of  equations.  The  second  is  used  on  a  time  dependent  problem  to  accelerate 
convergence  to  steady  state.  The  PARC  code  can  benefit  from  both  types  of  algorithms. 
The  quickest  improvement,  however,  can  be  realized  by  implementing  the  second  type  of 
multigrid.  This  is  due  to  the  fact  that  there  are  no  implicit  matrix  solver  currently 
implemented  in  the  PARC  code. 

In  order  to  discuss  the  multigtid  algorithm,  we  first  need  to  establish  some  notation. 
Let  be  the  grid  on  which  a  solution  is  desired.  Let  be  the  grid  obtained  by 

removing  every  other  grid  line  from  in  each  coordinate  direction.  There  is  an  n, 

,  such  that  Af^  cannot  be  coareened  in  the  manner  stated  above.  Also,  let 

denote  the  calculated  solution  (by  the  solver  of  choice)  on  grid  i  at  time  step  n. 

Next,  we  define  two  operators  which  provide  communication  between  solutions  on 
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different  grids,  namely  a  prolongation  operator  I :  Af  and  a  restriction  operator 


I  ^  .  For  this  discussion,  the  prolongation  operators  and  restriction  operators 

can  be  viewed  as  interpolation  and  injection,  respectively. 

From  the  approximately  factored  algorithm  generated  from  (2),  we  see  that  the 

system  of  equations  to  be  solved  on  Af  can  be  written  as 

Li(Qr)^Oi*^=RiQi) 

where  L  is  the  matrix  constructed  from  the  difference  scheme. 

The  concept  behind  multigtid  is  to  allow  convergence  to  be  computed  in  part  by  the 
coarse  grid  equations.  As  mesh  size  of  the  grid  increases,  the  time  step  restriction  given  by 
the  CFL  condition  can  be  relaxed.  Thus,  the  coarse  grid  equations  can  march  to  the  steady 
state  condition  faster  than  the  fine  grid  equations.  In  order  to  retain  the  truncation  error 
on  the  fine  grid,  the  coarse  grid  solutions  must  be  transmitted  to  the  fine  grid  at  regular 
intervals. 

The  algorithm  currently  implemented  in  the  PARC  code  is  due  to  Jameson  [6].  The 
following  is  a  description  of  the  algorithm  in  pseudo  code  with  the  Beam  Warming  method 
as  solver. 

For  level  =  1  to  number  of  levels  do; 

Solve  LiQil^el)  £^Olelel=RiOlevel)  fOT 


Update  Qilvel  *'Q  level + ^  0 level 


If  level  <  number  of  levels  then  do: 


^level+l*~^  ‘^^level  ^Qlevel^ 


Qlevel*l=I  ^Qlevel 


For  level  =  number  of  levels  - 1  to  2  do: 

^Qlevel'~^Ql6vel'^^^0 level*! 


Hence,  the  coarse  grids  are  given  the  current  right  hand  side  from  the  fine  grids,  and  the 
old  solution  on  the  coarse  grid  is  the  nevyr  solution  on  the  fine  grid.  After  solutions  are 
calculated  on  all  grids,  the  differences  are  transferred  to  the  fine  grid. 

The  same  general  algorithm  can  also  apply  to  the  pseudo  Runge-Kutta  algorithm, 
but  some  modifications  must  be  made.  In  the  Runge-Kutta  algorithm,  R  is  calculated  five 
times.  Each  time  R  must  be  modified  by 

R{Q{I) )  -R(Q{I) )  +  il'^RiQievei-i)  -R(QiO) ) ) 
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In  other  words,  R{Q{0))  is  merely  a  restriction  of  the  fine  grid  right  hand  side.  Fmther 


right  hand  side  calculations  are  accotmted  for  by  the  formula  above. 

5.  Sample  problem. 

The  problem  we  consider  is  flow  through  a  two  dimensional  diverging  nozzle  with 
straight  duct  segments  at  each  end.  The  initial  conditions  given  the  PARC  code  were 
appropriate  for  Mach  .29  free  stream  flow.  Reference  pressure  was  at  ISpsia  while  the 
reference  temperature  was  600  degrees  Rankine.  The  ratio  of  specific  heats  was  1.4.  The 
inflow  pressure  and  temperature  agreed  with  the  reference  pressure  and  temperature,  while 
the  outflow  boundary  condition  relied  on  a  static  pressure  of  14.13psia.  The  domain  and 
grid  used  to  obtain  a  solution  are  given  in  Figures  1  and  2. 

As  the  grid  given  in  Figure  2  is  33  by  11,  it  can  be  coarsened  only  once  and  still 
retain  agreement  between  coarse  and  fine  grid  points.  For  this  reason,  comparisons 
between  the  multigrid  PARC  code  and  the  original  PARC  code  were  made  on  three  grids. 
On  the  33  by  11  grid.  Multigrid  operated  with  two  levels.  On  the  65  by  21  grid  it  had  3 
levels,  while  on  the  129  by  41  grid  there  were  4  levels.  The  following  table  gives  a 
comparison  of  the  two  methods  on  each  grid.  All  solutions  were  due  to  calculations  by  the 
Beam-Warming  solver  as  implemented  in  the  PARC  code.  The  number  of  iterations  given 
in  the  table  is  the  iteration  count  reqtured  for  the  L2  norm  of  the  residual  to  be  driven 

below  10"^^  . 
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As  can  be  seen  above,  the  speed  of  convergence  (number  of  iterations  required)  is 
approximately  the  same  for  each  grid  even  though  the  grid  sizes  vary  by  a  factor  of  two  in 
each  direction  every  time  the  grid  is  changed.  The  convergence  seen  is  due  to  the 
contribution  of  the  coarser  grids.  Since  each  problem  given  above  has  the  same  size  grid 
for  the  coarsest  mesh,  the  convergence  remains  roughly  the  same. 

6.  Conclusions. 

The  problem  that  we  have  attempted  to  address  during  the  ten  weeks  of  the  1991 
Summer  Faculty  Fellowship  prograr  was  implementation  of  multigrid  and  adaptive  gridding 
techniques  in  the  PARC  code.  This  is,  of  course,  an  ambitious  project  which  cannot  be 
concluded  in  a  ten  week  time  frame.  Therefore,  the  implementation  of  Multigrid  in  the 
PARC  code  was  begun  first.  We  have  shown  that  there  can  be  a  significant  improvement 
in  the  convergence  of  a  fluid  flow  problem  if  Multigrid  is  applied  with  a  large  number  of 
levels.  Due  to  the  extra  overhead  required  for  the  Multigrid  solver,  a  significant  savings 
might  not  be  realized  with  a  small  number  of  levels.  This  implies  that  extra  care  should  be 
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made  in  choosing  a  grid  which  can  be  coarsened  a  large  nximber  of  times.  In  many 
instances,  a  slight  change  in  the  mesh  size  in  either  direction  would  provide  an  appropriate 
grid. 

Though  the  results  given  in  this  paper  are  encouraging,  this  certainly  can  not  be 
viewed  as  the  final  implementation.  There  are  many  problems  which  still  need  to  be 
addressed.  Though  the  multigrid  algorithm  implemented  in  the  PARC  code  at  present  only 
works  for  grids  which  have  one  block,  it  can  be  easily  changed  to  work  with  multi-block 
problems.  This  can  be  accomplished  by  performing  one  Multigrid  cycle  on  each  block 
before  moving  to  the  next.  A  more  efficient  algorithm  would  cycle  through  each  block  on 
a  level  before  going  to  the  next  level.  An  implementation  of  the  second  type,  however, 
would  require  a  different  block  interpolation  routine  than  is  currently  available  in  the  PARC 
code. 

Self  adaptive  grids  can  be  implemented  an  number  of  ways.  In  a  finite  difference 
code  the  easiest  way  to  refine  the  grid  would  be  to  create  a  new  block  with  the  needed 
refinement.  In  order  to  solve  on  the  new  composite  mesh,  the  boundary  interpolation 
routines  for  the  new  block  need  to  be  changed  to  reflect  conservation  principles  across  the 
block  boundary.  Currently  the  PARC  code  does  not  implement  this  with  sufficient  accuracy. 

Also,  the  PARC  code  is  limited  in  the  choice  of  available  solvers.  This  should 
certainly  be  augmented  to  include  a  flux  splitting  solver  with  upwind  differencing.  This 
type  of  solver  appears  to  do  well  on  approximating  supersonic  flow.  Other  solvers  could 
also  be  implemented  if  desired. 

Finally,  as  was  mentioned  earlier,  the  boundary  conditions  of  the  PARC  code  are  all 
explicit.  Implicit  boundary  conditions  could  improve  the  performance  of  the  Beam-Warming 
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salver  or  odieriiin>lidtsoh'cis.  Tins  cctild  foidier  iooease  the  efiiectiveiiess  of  the  PiyiC 
code. 

The  prc^KKed  dianges  to  the  PARC  code  menlioiied  above  are  intennediate  steps  to 
diegoaliaeiitioiiedatdiebegiimingoftiiissectioi].  Inq>kmeiiiairan  of  adaptive  refinement 
methods  win  also  increase  the  effectnreaess  of  die  PARC  code  and  lead  to  a  production  code 
which  is  able  to  find  accurate  solutions  in  less  thup- 
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Figure  1.  The  domain  of  the  flow  problem  presented  in  section  5. 


Figure  2.  The  33  by  11  grid  used  for  the  solution  of  the  problem  given  in  section  5. 
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ABSTRACT 


The  Photoactivation  Spectrometer,  Differential  Absorption  Sepctrometer  and 
Compton  Spectrometer  are  examined  as  possible  detection  systems  for  the  measurement 
of  the  spectra  of  pulsed  bremsstrahlung.  Detectors  considered  include  the  Ionization 
Chamber,  PIN  Junction  Diode,  Thermoluminescent  Dosimeter  (TLD)  and  Scintillator. 
The  Photoactivation  Spectrometer  samples  the  spectrum  at  a  relatively  small  number  of 
points,  and  is  limited  by  a  lack  of  complete  and  reliable  data  on  the  nuclear  parameters 
involved.  Since  the  absorption  characteristics  of  available  absorbers  do  not  vary 
significantly  at  energies  above  about  0.8  Mev,  Differential  Absorption  Spectrometers 
show  poor  energy  resolution  in  that  region.  They  do,  however,  provide  good  results  in 
the  low  energy  end  of  the  spectrum.  The  Time  Projection  Compton  Spectrometer 
(TPCS)  shows  the  greatest  promise  as  a  detection  system  for  the  measurement  of  pulsed 
X-ray  spectra  in  the  whole  range  of  interest,  0.1  Mev  to  2.0  Mev,  although  its 
deployment  may  be  costly  in  terms  of  time,  money  and  manpower. 

1.  INTRODUCTION 

A  proposal  is  under  consideration  to  build  a  facility  at  AEDC  for  the  purpose 
of  testing  space-bound  military  equipment  in  a  nuc  .ar-weapcns  environment,  with  a 
view  to  ensuring  that  it  would  survive  a  nuclear  attack.  The  major  threat  comes  from 
the  release  of  X-radiation,  which  can  result  in  a  degradation  of  performance  or  even 
permanent  damage.  Simulation  of  nuclear-weapons  radiation  would  be  achieved  by 
exposing  the  equipment  to  intense  flash  X-ray  sources.  These  are  generated  by 
injecting  high-energy  pulsed  electron  beams  into  standard  converter  configurations, 
where  they  produce  continuous  X-rays  (bremsstrahlung)  together  with  characteristic 
line  radiation.  In  order  to  implement  this  testing  program,  the  energy  spectrum  of  the 
radiation  must  be  accurately  known.  The  spectra  emitted  by  pulsed  X-ray  simulators 
can  be  computed  from  the  time  histories  of  the  diode  current  and  voltage,  based  on  the 
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predictions  of  a  coupled  electron-photon  Monte  Carlo  code,  such  as  the  Integrated 
TIGER  Series  (ITS)*  •  However,  the  electron  beam  parameters  are  not  well  known, 
the  photon  spectrum  depends  on  the  angle  of  incidence  of  the  electrons  on  the 
bremsstrahlung  converter,  and  the  simulators  suffer  from  a  lack  of  shot-to-shot 
reproducibility.  It  is  therefore  desirable  to  devise  a  technique  for  measuring  the  photon 
spectrum  directly,  with  good  precision,  and  on  a  shot-to-shot  basis.  The  range  of 
interest  is  from  0.1  Mev  to  2.0  Mev. 

For  the  purposes  of  this  Report,  an  X-ray  Spectrometer  is  defined  as  a  device 
which  is  used  to  measure  the  spectrum  of  pulsed  bremsstrahlung.  The  spectrum  may 
be  described  either  in  terms  of  the  spectral  photon  distribution  U(e)  or  the  spectral 
fluence  F(e),  where  U(e)  de  is  the  number  of  incident  photons  per  unit  area  whose 
energy  lies  in  the  range  e  to  e  +  de,  and  F(e)  de  is  the  energy  per  unit  area  of  those 
photons.  Clearly  F(e)  =  e  U(e).  An  important  characteristic  of  a  radiation  detector  is 
its  Response  Function  R(e),  which  relates  the  input  F(e)de  to  the  detector  to  the 
measured  output  dM  (charge  collected,  for  example)  as  follows: 


dM  =/?0)F^)ie 

(1.1) 

(1.2) 

A  number  of  detection  systems  has  been  described  recently  in  the  literature^*, 
using  a  variety  of  radiation  detectors.  Section  II  contains  a  discussion  of  the  structure 
and  use  of  these  detectors,  and  in  Section  III  we  present  an  analysis  of  the  detection 
systems  themselves,  including  the  techniques  used  to  unfold  the  spectrum.  In  Section 
IV  we  offer  some  tentative  conclusions  about  the  relative  merits  of  the  systems 
considered. 


n.  RADIATION  DETECIORS 

Broadly  speaking,  a  radiation  detector  is  simply  a  device  which  interacts  with 
the  incident  radiation  to  produce  some  observable  effect.  It  consists  essentially  of  the 
following  components: 

1.  A  sensitive  volume,  containing  the  material  which  interacts  with  the  radiation. 

2.  A  supporting  structure,  v/hich  encloses  the  sensitive  volume  and  provides  the 
necessary  instrumentation,  if  any.  An  essential  part  of  this  structure  is  the  entrance 
window,  through  which  the  radiation  is  introduced  into  the  sensitive  volume. 
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3.  An  output  mechanism,  which  extracts  the  output  signal  and  presents  it  to  the  signal 
processing  system. 

A  number  of  physical  processes  have  been  used  as  the  basis  for  constructing 
practical  radiation  detectors,  e.g.  Nuclear  Isomerism,  Ionization,  Thermoluminescence, 
etc.,  and  each  of  these  will  be  discussed  in  turn. 

Photoactivation  Detectors 

Excited  nuclear  states  can  be  produced  by  a  number  of  mechanisms,  including 
electromagnetic  excitation.  The  decay  rate  depends  strongly  on  the  angular  momentum 
I  of  the  excited  state,  and  if  there  is  a  large  difference  in  I  between  the  excited  state  and 
all  the  lower  states,  transitions  to  these  states  are  highly  forbidden  and  the  lifetime  of 
the  excited  state  turns  out  to  be  relatively  long.  These  long-lived  excited  nuclear  states 
are  called  Isomers.  They  have  lifetimes  ranging  from  a  few  seconds  to  several  hours. 
Photoactivation  of  nuclear  isomers  proceeds  via  (/,/')  reactions,  in  which  a  nucleus  is 
excited  from  its  ground  state  to  a  gateway,  or  activation,  state  by  absorption  of  a 
photon  y,  and  subsequently  decays  to  the  metastable  state  (isomer)  with  emission  of  a 
photon  y'.  The  isomer  then  decays  by  process  of  delayed  fluorescence,  emitting  a 
signature  photon  which  may  be  used  to  identify  the  isomer.  The  isomer  yield 
represents  the  output  of  the  device,  and  is  used  to  obtain  information  about  the  intensity 
of  the  incident  radiation.  The  use  of  this  technique  has  been  discussed  extensively  in  an 
earlier  Report® ,  and  will  not  be  considered  further  here. 

Ionization  Chambers 

The  ionizing  effect  of  electromagnetic  radiation  is  utilized  in  a  number 
of  devices  to  measure  such  quantities  as  the  intensity  of  the  radiation,  the  absorbed 
dose,  etc.  Extensive  accounts  of  the  design  and  operation  of  ionization  chambers  have 
been  given  by  W.  H.  Tait®  and  by  J.  W.  Boag‘°.  The  basic  structure  of  the  instrument 
is  shown  in  Fig.  1.  The  material  inside  the  sensitive  volume  is  usually  a  gas  and  hence 
they  are  known  generically  as  gas  counters.  They  typically  consist  of  two  electrodes 
placed  inside  a  chamber  containing  the  gas.  The  electrodes  may  be  co-axial  cylinders 
or  concentric  spheres,  but  are  more  often  a  pair  of  parallel  plates,  separated  by 
insulating  spacers,  and  in  what  follows  we  shall  assume  that  this  is  the  case.  The 
advantage  of  the  parallel  plate  geometry  is  that  a  uniform  electric  field  is  maintained 
between  the  electrodes.  However,  this  uniformity  tends  to  break  down  near  the  edges 
of  the  plates,  where  the  field  lines  curve  outwards  and  the  field  strength  gradually  falls 
off  to  zero.  To  avoid  this  type  of  distortion,  which  makes  it  difficult  to  interpret  the 
measured  output  of  the  detector,  guard  ring  plates  are  placed  around,  and  co-planar 
with,  the  electrodes.  They  are  kept  at  the  same  potentials  as  the  electrodes,  but  are  not 


24-3 


connected  to  the  signal  processing  system.  The  guard  rings  not  only  remove  the  non¬ 
uniformity  of  the  field  near  the  edges  of  the  electrodes,  they  also  help  to  define  the 
sensitive  volume  more  precisely. 

The  chamber  is  made  of  a  solid  which  is  rigid,  impervious  to  the  gas  and  free 
from  chemical  attack  by  it,  and  does  not  itself  contaminate  the  gas  by  outgassing.  Both 
the  gas  and  the  solid  must  be  stable  in  the  radiation  environment.  Ion  pairs  are  created 
as  a  result  of  the  interaction  of  the  radiation  with  the  gas.  This  interaction  is  a  complex 
process’^,  in  which  there  is  a  transfer  of  energy  from  photons  to  electrons  via  the 
photoelectric  effect,  Compton  scattering  and  pair  production,  and  electrons  impart 
energy  to  the  material  by  excitation,  ionization  and  elastic  scattering.  The  free  electrons 
are  attracted  to  the  anode  and  the  positive  ions  to  the  cathode.  The  charge  collected  by 
the  electrodes  may  be  allowed  to  accumulate  and  measured  as  a  voltage  pulse  (charge 
mode)  or  allowed  to  leak  away  continuously  in  the  form  of  an  output  current  (current 
mode). 

There  is,  of  course,  the  question  of  recombination  of  electrons  and  positive  ions 
to  form  neutral  atoms,  as  well  as  the  possibility  that  so.me  electrons  may  acquire 
enough  energy  to  cause  further  ionization.  These  depend  on  such  factors  as  the 
potential  difference  V  between  the  electrodes  and  the  pressure  of  the  gas.  Appreciable 
recombination  is  expected  to  take  place  at  small  V,  whereas  secondary  ionization 
occurs  for  large  values  of  V,  or  at  low  gas  pressure:.  We  shall  assume  that  the  potential 
difference  applied  is  large  enough  to  avoid  appreciable  recombination,  but  not  large 
enough  to  produce  secondary  ionization. 

Since  our  ultimate  objective  is  the  measurement  of  the  spectra  of  pulsed 
bremsstrahling,  we  shall  be  concerned  with  the  operation  of  the  ionization  chamber  in 
charge  mode.  If,  as  is  likely  to  be  the  case,  the  duration  of  the  pulse  (~  ns)  is  much 
less  than  the  time  of  transit  of  the  ions  across  the  ionization  chamber,  the  ionization 
may  be  treated  as  taking  place  instantaneously.  We  shall  assume,  for  the  time  being, 
that  the  ionization  chamber  is  operating  under  ideal  conditions.  This  implies  that  all 
the  ion  pairs  formed  in  the  chamber  are  collected  by  the  applied  field  at  the  electrodes, 
without  loss  by  recombination  or  augmentation  by  secondary  ionization.  The  total 
charge  (of  either  sign)  collected  is  then  given  by 

Q=ne=^e  (2.1) 

>v 

where  n  is  the  number  of  ion  pairs  produced,  €  is  the  electronic  charge,  E  is  the 
absorbed  energy  and  W  is  the  mean  energy  required  to  produce  an  ion  pair.  A 
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summary  of  W  values  for  a  number  of  different  gases,  for  several  ionizing  agents  has 
been  given  by  I.  T.  Myers^^  As  a  result  of  the  deposition  of  charge  on  the  electrodes, 
a  voltage  pulse  appears  at  the  output  end  of  the  detector.  This  is  illustrated  in  Fig.  2. 
Prior  to  the  introduction  of  the  radiation,  a  potential  difference  V  is  applied  through  a 
load  resistance  R  across  the  detector  plates,  assumed  to  have  a  capacitance  C.  Charges 
q  and  -q,  respectively,  are  deposited  on  the  negative  and  positive  plates,  causing  the 
potential  difference  across  them  to  fail  by  an  amount  v  =  q/C.  The  capacitor  is 

subsequently  recharged  by  the  power  supply,  with  time  constant  RC.  In  practice,  the 
signal  pulse  is  extracted  through  a  coupling  capacitance  Cg,  which  transmits  only  ac 

voltages,  thereby  isolating  it  from  the  much  larger  dc  component.  The  output  pulse 
then  has  the  form  shown  in  Fig.  2,  rising  linearly  to  its  maximum  height  v  and  then 
decaying  exponentially,  with  time  constant  RC. 

The  simple  theory  outlined  above  must  be  modified  in  light  of  certain  practical 
considerations.  The  effect  of  recombination  has  been  discussed  extensively  by  Boag‘“. 
For  pulsed  radiation,  it  has  been  shown  that  the  collection  efficiency  f  (the  ratio  of 
charge  collected  to  charge  produced )  is  given  by 


/  =-h<l+Mo)  ;Mo  =  - 


a 


Uq 


7P0 


(2.2) 


where  a  is  the  recombination  coefficient,  X',  and  the  mobilities  of  the  positive  and 
negative  ions,  respectively,  d  the  separation  between  the  plates  and  the  charge 
created  per  unit  volume.  For  an  accurate  measurement  of  the  total  ionization  produced, 
the  chamber  should  be  made  air-equivalent,  i.e.  the  chamber  wall  should  have  the 
same  photon  mass-energy  absorption  coefficient  and  the  same  mass  stopping  power  as 
air^  This  can  be  accomplished,  approximately,  by  choosing  the  wall  material  to  be  a 
compound  of  low-Z  materials,  such  as  nylon,  carbon,  silica  (lined  with  graphite  to 
provide  adequate  conduction),  i.e.  having  an  effective  Z  closely  matching  that  of  air. 
Practical  considerations,  such  as  ease  of  fabrication,  may  require  the  use  of  a  wall 
material  that  is  not  air-equivalent,  however.  Tanaka  et  al"  describe  a  parallel  plate 
ionization  chamber  in  which  Aluminium  is  chosen  as  the  wall  material  and  the  whole 
region  between  the  plates  is  the  sensitive  volume. 

Solid  State  Detectors 

The  operation  of  a  solid  state  detector  is  best  understood  in  terms  of  the  energy 
band  structure  of  the  material.  The  energy  levels  available  to  an  electron  in  a  ciystal 
form  a  set  of  allowed  bands  which  are  separated  by  forbidden  bands.  In  the  ground 
state  of  the  crystal,  there  may  be  several  filled  bands,  the  uppermost  of  v/hich  (the 
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valence  band)  is  separated  from  the  conduction  band  by  an  energy  gap  which  is 
typically  <  iO  eV  for  a  solid  insulator,  but  may  be  as  little  as  1  eV  for  a 
semiconductor.  Since  there  are  no  vacant  states  in  the  allowed  bands,  there  is  no 
n:igration  of  charge  within  the  crystal,  unless  electrons  are  given  enough  energy  to 
surmount  the  forbidden  energy  gap  and  reach  the  conduction  band.  This  energy  can  be 
supplied  by,  among  other  things,  ionizing  radiation.  Electrons  which  reach  the 
conduction  band  may  migrate  through  the  material  under  the  action  of  an  electric  field. 
The  elevation  of  an  electron  to  the  conduction  band  creates  a  vacancy  or  "hole"  in  the 
valence  band  which  may  be  filled  by  a  nearby  valence  electron,  moving  under  the 
action  of  an  electric  field.  The  whole  process  is  similar  to  the  creation  of  an  ion  pair  in 
a  gas. 

One  of  the  essential  features  of  an  ionization  chamber  is  that  application  of  a 
moderate  potential  difference  across  the  chamber  should  not  result  in  the  flow  of 
current.  The  use  of  a  solid  insulator,  in  lieu  of  the  gas  in  a  conventional  ion  chamber, 
has  two  distinct  advantages.  First,  the  greater  stopping  power  of  a  solid,  as  compared 
with  a  gas,  should  enhance  the  detection  efficiency  of  the  instrument.  Secondly,  the 
width  of  the  energy  gap  between  the  valence  band  and  the  conduction  band  in  a  solid  is 
typically  much  smaller  than  the  ionization  energy  of  a  gas;  hence  the  mean  energy 
required  to  form  an  ion  pair  is  much  smaller  in  a  solid  than  in  a  gas,  and  greater  energy 
resolution  should  result. 

The  parallel  plate  ionization  chamber  with  a  solid  insulator  suffers,  however, 
from  a  serious  drawback,  caused  by  the  presence  of  impurities  in  the  material.  If  the 
allowed  energy  states  of  these  impurities  are  situated  just  below  the  conduction  band  of 
the  host  material,  electrons  normally  resident  in  these  states  (donor  states)  are  quite 
easily  elevated  into  the  conduction  band  by  thermal  excitation.  This  creates  positive 
ions  in  fixed  positions  which  act  as  "traps"  for  electrons  from  the  signal  pulse.  On  the 
other  hand,  discrete  impurity  levels  may  be  located  just  above  the  valence  band,  and 
electrons  from  the  valence  band  may  be  thermally  excited  into  these  impurity  states 
(acceptor  states),  creating  negative  ions  which  can  trap  positive  ions  from  the  signal 
pulse.  Crystal  defects  have  a  similar  effect.  These  traps  represent  a  serious  departure 
from  idealized  ion  chamber  conditions,  which  can  only  be  realized  in  an  intrinsic 
material,  i.e.  one  which  is  absolutely  pure  and  perfect.  Silicon  and  germanium  are 
good  candidates,  but  they  are  semiconductors,  with  energy  gaps  of  the  order  of  1  eV 
between  the  valence  and  conduction  bands,  and  therefore  susceptible  to  leakage  current 
ai  low  applied  potential  differences  or  modest  temperatures. 
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Now  consider  a  single,  high  purity  crystal  (Si  or  Ge)  which  is  doped  on  one  side 
with  n-type  material  and  on  the  other  with  p-type  material.  The  crystal  will  contain 
some  free  electrons  in  the  conduction  band  and  some  free  holes  in  the  valence  band. 
Thus  the  n-type  material  will  contain  many  free  electrons  and  very  few  holes,  while  the 
p-type  material  will  contain  many  free  holes  but  very  few  electrons.  A  current  can  be 
made  to  flow  across  the  device  if  a  potential  difference  is  applied  in  such  a  direction  as 
to  attract  the  free  charges  toward  the  junction.  This  is  known  as  forward  bias.  On  the 
other  hand,  if  the  polarity  of  the  applied  potential  is  reversed  (reverse  bias),  a  region 
which  is  devoid  of  charge  will  be  established  in  the  vicinity  of  the  junction.  This 
region  is  known  as  the  depletion  region,  and  it  behaves  like  an  insulator,  while  the  n- 
and  p-type  regions  behave  like  conductors.  The  whole  arrangement  acts  like  an 
ionization  chamber,  with  the  depletion  region  being  the  sensitive  volume.  If  ionizing 
radiation  is  introduced  into  the  "chamber",  it  creates  ion  pairs  (free  elec*"ons  and 
holes)  in  the  depletion  region,  and  these  charges  migrate  to  the  electrodes  under  the 
action  of  the  applied  potential.  A  Silicon  p-n  junction  detector  is  shown  schematically 
in  Fig.  3.  If  all  of  these  charges  are  collected  (without  trapping  or  recombination),  the 
charge  pulse  is  given  by 

Q=ne=^e  (2.1) 

where  n  is  the  number  of  ion  pairs  formed,  e  is  the  electronic  charge,  W  is  the  mean 
energy  required  per  ion  pair  and  E  is  the  total  energy  deposited.  It  will  be  recalled  that 
an  identical  expression  was  obtained  for  the  gas  ionization  chamber. 

The  junction  diode  as  outlined  above  suffers  from  the  drawback,  already  alluded 
to,  that  electrons  may  be  thermally  excited  across  the  narrow  gap  between  the  valence 
and  conduction  bands,  producing  a  leakage  current  which  increases  with  the  applied 
(reverse  bias)  potential.  Another  source  of  leakage  current  is  the  presence  of  minority 
carriers,  p-type  impurities  in  the  n-type  material  and  vice  versa.  The  thickness  of  the 
depletion  region  is  proportional  to  -^pV  where  p  is  the  resistivity  of  Silicon  and  V 
is  the  applied  voltage.  This  thickness  determines  the  number  of  ion  pairs  formed, 
hence  the  detection  efficiency  of  the  device.  To  increase  the  thickness,  either  p  or  V 
must  be  increased.  Increasing  V  would  lead  to  insulation  breakdown  in  the  sensitive 
volume.  The  resistivity  of  Si  can  be  effectively  increased  by  diffusing  carefully 
controlled  amounts  of  I/3  ions  into  the  lattice  of  a  p-type  crystal  of  Si,  where  they 
form  neutral  pairs  with  the  negative  acceptor  ions.  The  increase  in  p  means  that  larger 
bias  voltages  can  be  applied  without  causing  electrical  breakdown,  and  the  thicloiess  of 
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the  depletion  layer  is  thereby  increased.  Junction  diodes  with  thick  depletion  layers 
produced  in  this  way  are  called  PIN  junctions. 

Consider  a  crystal  insulator  which  is  doped  with  a  selected  material  which 
introduces  impurity  levels  between  the  valence  band  and  the  conduction  band. 
Irradiation  of  the  crystal  creates  free  electrons  which  are  elevated  to  the  conduction 
band,  where  they  //ander  about  in  the  solid  until  they  either  return  to  the  valence  band 
or  are  trapped  at  tb?  impurity  centers.  The  impurity  acts  like  an  activator,  in  the  sense 
that  the  electrons  remain  trapped  until  they  are  subsequently  released  by  some  kind  of 
excitation.  If  the  release  is  caused  by  heating,  thermoluminescence  is  said  to  occur. 
The  trapped  electrons  are  excited  up  to  the  conduction  band,  whence  they  return  to  the 
valence  band  with  the  emission  of  light.  This  process  is  shown  schematically  in  Fig.  4. 
The  amount  of  light  emitted  is  proportional  to  the  number  of  electrons  released  from 
the  traps,  which  is  in  turn  proportional  to  the  absorbed  dose.  Thermoluminescence  is  a 
property  of  most  solids,  at  least  to  some  degree.  To  be  useful  as  a  radiation  detector, 
however,  a  thermoluminescent  phosphor  meat  have  a  strong  light  output,  and  retain 
the  trapped  electrons  for  some  time  at  th<;  temperature  at  which  it  is  to  be  used. 

Several  materials  exhibit  these  properties,  including  manganese  activated  Calcium 
Sulphate  (CaS04:Mn),  manganese  activated  Calcium  Fluoride  (CaF2:Mn)  and  Lithium 

Fluoride.  These  phosphors  are  reviewed  extensively  by  J.  F.  Fowler  and  F.  H.  Attix**. 
Instrumentation  needed  for  readout  of  the  TLD  is  also  discussed. 

Scintillator  Counters 

By  contrast,  there  are  many  solid  .^d  liquid  substances  which  possess  the 
property  of  instantaneous  luminescence,  or  fluorescence.  The  incident  radiation  raises 
a  number  of  electrons  to  excited  energy  slUes,  from  which  they  decay  with  the 
emission  of  flashes  of  light,  or  scintillations.  Commonly  used  scintillators  include 
organic  and  inorganic  crystals,  plastics,  glasses,  liquids  and  gases.  The  pulse  of  light 
emitted  by  a  scintillator  is  typically  very  weak,  and  must  be  amplified  by  a  photo¬ 
sensitive  device,  such  as  a  photomultiplier.  The  light  is  emitted  isotropically  by  the 
scintillator  and  must  be  channelled  towards  the  photomultiplier  by  an  optical  coupling 
device,  or  light  pipe.  The  scintillator,  optical  coupling  and  photomultiplier  together 
constitute  the  radiation  detection  system,  the  scintillator  being  the  sensitive  volume. 
The  whole  assembly  is  enclosed  in  a  light-tight  container  and  the  radiation  is  introduced 
through  an  entrance  window.  An  extensive  survey  of  scintillation  detectors  has  been 
given  by  W.  J.  Ramm‘’. 
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m.  DETECTION  SYSTEMS 


Differential  Absorption  Spectrometers  (DAS) 

In  designing  an  X-ray  Spectrometer,  the  basic  idea  is  to  "sample"  the  spectrum 
at  different  points,  using  a  number  of  different  detectors.  One  of  the  limitations  of  the 
foil  activation  technique  is  that  the  spectrum  can  be  sampled  at  relatively  few  points 
(these  being  determined  by  the  activation  levels  of  the  nuclei  involved)  and  there  is 
only  a  small  number  of  isotopes  which  are  good  candidates  for  this  procedure.  An 
alternative  approach  is  to  use  a  large  number  of  identical  detectors,  and  to  interpose 
different  thicknesses  of  different  absorbing  materials  between  them  and  the  source  of 
the  radiation.  Each  detector  then  exhibits  a  different  spectral  response,  and  the 
spectrum  can  be  unfolded  once  the  detector  response  functions  are  known.  The 
response  of  the  i^h  detector  is,  from  Equation  (1.2), 

(3.1) 

The  index  i  runs  from  1  to  n,  where  n  is  the  number  of  detectors. 
Equation  (3.1)  represents  a  system  of  n  integral  equations,  which  cannot  be  solved  for 
arbitrary  M^  and/?(^).  To  overcome  this  difficulty,  the  spectrum  is  divided  into  m 
energy  "bins",  of  finite  width,  and  equation  (3.1)  then  becomes 

(3.2) 

r=l 

where  Ae,.  is  the  width  of  the  r^^  energy  bin. 

The  response  functions  may  be  calculated^  if  the  mass  energy  absorption 
coefficients  of  the  absorbers  and  the  detector  material  are  known.  If  an  absorber  of 
thickness  xj  and  density  pj  is  interposed  between  the  source  and  the  i^^  detector,  the 

fluence  F(e)  of  the  incident  radiation  is  attenuated  by  a  factor 


exp[-(/i„ Subsequently,  a  fraction 

energy  striking  the  detector  is  absorbed  by  it,  where  the  subscript  D  refers  to  the 
detector.  Thus,  assuming  that  the  measured  output  of  the  detector  is  proportional  to  the 
absorbed  energy,  we  have 


■ 

1-exp 

- 

Pl^d 

» 

<  P  / 

D 

of  the 


dA/j  =  constant  .exp[-(Ata, /p),(e)PiA:,  ] 


1 

r  , 

■ 

“ 

l-exm 

I 

1 

Pm 

Pc^D 

► 

1 

[ 

<  P  > 

D 

F^)Ae  (3.3) 


24-9 


whence  the  response  function  can  be  i)und  by  comparison  with  equation  (1.1).  It  is 
assumed  in  the  above  discussion  “*  that  there  is  electronic  equilibrium  between  the 
absorber  and  the  detector  (i.e.  their  atomic  numbers  are  approximately  equal),  so  that 
the  loss  of  electrons  which  leave  the  detector  without  being  absorbed  is  balanced  by  a 
an  equal  gain  of  electrons  entering  the  detector  from  the  absorber.  Where  electronic 
equilibrium  does  not  exist,  the  response  functions  must  be  found  from  a  full  Monte 
Carlo  radiation  transport  computation.  The  transport  codes  TIGERS  or  the  discrete 
ordinates  code  CEPXS-ONETRAN*®  can  be  used  for  this  purpose. 

A  Differential  Absorption  Spectrometer  must  be  used  in  conjunction  with  an 
unfolding  code  or  technique  which  allows  the  spectral  fluence  F(e)  to  be  calculated 
from  Equation  (3.1).  A  computer  code  named  YOGI*’  has  been  developed  by  T.  L. 
Johnson  and  S.  G.  Gorbics  and  used  in  the  unfolding  process.  A  trial  spectrum  and  the 

known  response  functions  are  input  into  the  program,  which  yields  calculated  values  of 
the  detector  response.  These  are  compared  with  the  measured  values  Mj  and  a  root 

mean  square  deviation  is  determined.  The  code  then  perturbs  the  trial  spectrum  at 
some  point  by  a  fixed  amount  and  a  new  rms  deviation  is  found.  The  perturbed 
spectrum  is  retained  if  it  yields  a  better  fit  (smaller  rms  error),  otherwise  it  is 
discarded.  The  procedure  is  repeated  for  randomly  chosen  points  until  some  pre¬ 
determined  level  of  accuracy  is  reached  or  no  further  reductions  in  error  can  be 
achieved.  It  should  be  noted  that  Equation  (3.1)  represents  a  set  of  n  equations  in  m 
unknowns.  Thus  if  n  is  less  than  m,  the  problem  is  underdetermined  and  an  infinite 
number  of  solutions  is  possible.  In  such  a  case,  constraints  must  be  imposed  at  each 
perturbation  in  the  unfolding  routine,  to  ensure  that  only  physically  acceptable  solutions 
are  considered.  A  "realistic"  trial  spectrum  must  be  used,  negative  values  should  be 
rejected  and  the  solution  should  be  continuous  except  for  the  K-line  in  a  specific  energy 
bin. 

A  differential  absorption  spectrometer  has  been  developed  and  used  by  G.  A 
Carlson  and  L.  J.  Lorence^  on  various  bremsstrahlung  sources.  The  arrangement 

utilized  a  stack  of  thirteen  flat  metal  absorbers  (2  of  Al,  5  of  Cu  and  6  of  a  90% 
Tungsten  alloy),  separated  by  TLD  arrays.  Each  array  consisted  of  four  CaF2:Mn 

TLD's  placed  in  an  Aluminum  tray  which  was  inserted  into  the  DAS.  The  DAS  was 
fielded  at  a  distance  of  at  least  30  cm  from  the  source,  without  collimation  or 
shielding.The  spectrum  was  divided  into  thirty  energy  bins,  equally  spaced  on  a 
logarithmic  scale.  The  unfolded  spectra  were  found  to  be  in  reasonable  agreement  with 
TIGERP  predictions. 
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A  DAS  which  incorporates  a  slightly  different  design  has  been  developed  by  S. 
G.  Gorbics  and  N.  R.  Pereira^®.  This  uses  an  array  of  spherical  absorbers,  each  with  a 
TLD  embedded  at  its  centre.  This  has  the  advantage  that  the  absorption  distance  is 
accurately  known  (the  radius  of  the  sphere),  irrespective  of  the  direction  of  the  incident 
radiation. 

Compton  Spectrometers 

As  mentioned  previously,  when  X-radiation  interacts  with  matter,  it  may  impart 
energy  to  electrons  via  the  photoelectric  effect,  Compton  scattering  and  pair 
production.  The  relative  importance  of  these  mechanisms'®,  for  various  atomic 
numbers  Z  and  photoj.  jrgies,  is  shown  in  Fig.  5.  For  low  Z  materials  and  incident 
photon  energies  in  the  range  of  interest  (100  kev  to  2.0  Mev),  Compton  scattering  is 
clearly  the  dominant  process.  Pair  production  cannot  occur  at  all  if  the  photon  energy 
is  less  than  the  rest  energy  of  the  electron-positron  pair  (1.02  Mev)  and  is  not 
significant  in  low  Z  materials  until  much  higher  energy.  Photoemission  can  also  be 
ruled  out  if  the  detector  is  designed  to  reject  low-energy  electrons.  Compton  scattering 
thus  becomes  an  attractive  mechanism  for  the  design  of  an  X-ray  spectrometer.  A 
collimated  beam  of  radiation  is  passed  through  a  magnetic  field  to  remove  any  electrons 
which  may  be  present  and  then  impinges  on  a  thin,  low  Z  scattering  foil.  The 
extraction  of  the  beam  of  recoil  electrons  and  measurement  of  the  signal  can  be 
performed  in  either  of  two  ways. 

The  magnetic  Compton  Spectrometer  is  a  device  in  which  the  electrons  scattered 
in  some  pre-determined  direction  are  collimated  and  their  energy  determined  by  using 
the  Klein-Nishina  cross-section  formula  for  Compton  scattering  and  making  corrections 
for  deviations  in  the  path  of  the  electron  and  loss  of  energy  due  to  multiple  scattering  as 
it  passes  through  the  foil.  This  technique  has  the  disadvantage  of  being  very 
inefficient,  howe  'er.  Only  a  narrowly  collimated  fraction  of  the  electron  emission  is 
meant  to  be  detected  and  extensive  shielding  is  required.  Nonetheless,  this  technique 
has  been  used  to  measure  the  spectrum  of  a  10  Mev  flash  X-ray  generator^'.  The  Time 
Projection  Compton  Spectrometer  (TPCS)^’,  developed  at  SANDIA,  is  an 
alternative  device  in  which  the  electrons  emitted  from  the  converter  foil  migrate 
towards  the  detector  in  a  narrow,  evacuated  drift  tube  with  an  axial  current-carrying 
wire.  The  magnetic  field  created  is  inversely  proportional  to  the  distance  from  the  wire 
and  the  electrons  experience  a  net  drift  along  the  axis  (in  the  opposite  direction  to  the 
current)  with  a  drift  velocity  (see  Appendix )  given  by 
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=|f(l+co^0>  (3.4) 

,  Tffymv  ^  „ 

where  £= — - —  (3.5) 

fh 

V  is  the  (constant)  speed  of  the  electrmi,  I  is  the  current,  e  and  m  are  the  charge  and 

ntass,  respectively,  of  the  electron  ,  7=11 — j\  and  0  is  the  angle  which  the  initial 

velocity  of  the  election  makes  with  the  magnetic  field. 

A  schematic  diagram  of  the  ITCS  is  shown  in  I^g.  6.  The  foil  is  fixed  in  a 
radial  plane  of  the  drift  tube  and  is  irradiated  at  normal  incidence  by  a  collimated  beam 
of  X-rays.  In  order  to  maximize  the  signal,  the  thickness  and  the  cross-section  of  the 
foil  should  be  made  as  large  as  possible.  However,  the  extent  to  which  this  can  be 
Is  limited  by  certain  practical  considerations.  Multiple  scattering  within  the  foil 
changes  both  the  direction  and  energy  of  the  elecaons,  and  therefore  distorts  the  energy 
spectrum.  This  places  a  limit  on  the  thickness  of  the  foil.  The  axial  dimension  of  the 
foil  must  be  less  than  the  pitch  of  the  helical  orbit,  otherwise  the  electron  would  collide 
with  the  foil  after  the  first  orbit.  The  radial  dimension  of  the  foil  should  be  equal  to 
that  of  the  detector  (a  plastic  scinti’*ator,  coupled  to  a  fast  photomultiplier  tube). 

The  TPCS  differs  from  the  DAS  in  that  there  is  '>nly  one  detector,  and  that  the 
required  photon  spectrum  is  mapoed  into  a  time  domain  by  the  time  of  flight 
measurements.  Thus  the  response  function  Rd^f)  is  now  a  function  of  both  energy  and 
time.  The  signal  S  <t )  measured  by  the  detector  at  time  t  is  given  by 

(3.6) 

where  U^yie  is  the  number  of  photons  whose  energies  lie  in  the  range  e  to  e  +de .  In 
oractice,  the  energy  spectrum  is  divided  into  a  number  of  energy  bins  of  finite  width 
Aej ,  so  that  Equation  (3.6)  is  replaced  by 

(3.7) 

J=l 

where  m  is  the  number  of  energy  bins.  The  lesponse  function  /?^;)  is  computed  by 
using  a  Monte  Carlo  simulation  of  the  X-ray  interaction.  The  number  of  photons  per 
unit  area  with  energy  6  which  strike  the  foil  and  are  scattered  through  the  (polar)  angle 


24-12 


6  (cos0==fi)  is  ^en  in  tenns  of  the  Kldn-Nidiina  differential  cioss-secti(»i  da^,fL) 
by 

dN  =U^yiJd(j(e,fi^  (3.8) 

vdieieR^  is  the  number  of  electrons  per  unit  volume  and  T  is  the  LhrVness  of  the  foil. 
The  kinetic  energy  and  diiectitm  of  the  recoil  electnm  are  obtained  by  flying 
conservation  of  energy  and  momentum.  The  azimuthal  angles  of  the  electron  directitHi 
are  assumed  to  be  distributed  uniformly  between  0  and  2t.  We  thus  obtain  the  number 
of  electrons  which  are  enutted  with  a  pven  speed  v  in  a  ^ven  direction.  From 
Equation  (3.4),  wfaidi  ^es  the  drift  velocity  of  the  electrcms,  the  number  of  electrons 
with  transit  time  t  produced  by  X-rays  of  energy  e  is  computed.  This  number  is 
clearly  proportional  to  dN  (see  Equation  (3.8),  and  hence  the  measured  signal  satisfies 
tiie  relation 


(3.9) 

dSi)=R^fV^¥e 

(3.10) 

where R^f)  is  the  response  function. 

The  unfolding  of  the  spectrum,  from  Equation  (3.7),  consists  of  a  least  squares 
fit  to  the  measured  signal  S(t),  using  a  trial  spectrum  N'(ei()Ae]f.  A  variational 

principle  is  applied  which  minimizes  the  square  of  the  difference  between  the  measured 
signal  S(t)  and  the  signal  calculated  by  using  the  trial  spectrum.  This  yields^ 

(3.11) 

/=! 

which  may  be  solved  by  maUi^s  methods  to  detennine  the  unknown  spectrum  U(e). 

IV.  DISCUSSION 

We  have  examined  a  number  of  X-ray  Spectrometers,  which  utilize  a  variety  of 
radiation  detectors  and  unfolding  techniques.  The  photoactivation  spectrometer  is 
limited  by  the  fact  that  it  samples  the  X-ray  spectrum  only  at  the  energies  of  the 
activation  levels  and  that  there  is  a  relatively  small  number  of  nuclei  which  are  good 
candidates  for  this  procedure.  It  also  suffers  from  a  lack  of  complete  and  reliable  data 
on  the  nuclear  parameters  of  interest.  Of  the  detectors  considered,  the  TLD  is  perhaps 
the  most  popular  because  it  is  small,  inexpensive,  available  in  a  wide  range  of  materials 
and  dose  ranges,  requires  no  instrumentation  during  irradiation  and  retains  accurate 


dose  infonnalion  for  a  l(Mig  time.  The  ionization  chamber,  by  contrast,  would  appear 
to  require  extennve  testing  to  determine  the  optimum  choices  of  its  size  and  shape, 
electrode  material,  type  of  gas  and  pressure,  etc.  The  absorbers  used  in  the  Differential 
AbsorpticHi  Spectrometer  do  not  show  sufficient  variation  in  their  absorption 
characteristics  to  provide  adequate  energy  resolution  above  about  800  kev,  and  their  use 
is  thus  effectively  limited  to  the  low  energy  r^m  The  Time  Projection  Compton 
Spectrometer,  on  the  other  hand,  can  be  used  to  ^  ffect  throughout  the  r^on  of 

interest  (0.1  Mev  to  2.0  Mev).  It  is,  however,  cui  .ne  and  expensive  to  field  and 

practical  considerations  may  well  restrict  its  use  to  the  high  energy  end  of  the  X-ray 
spectrum. 

APPENDIX 

The  follomng  is  adapted  from  a  Report  by  J.  R.  Lee^^  of  Sandia  National 
Laboratories.  Consider  the  motion  of  a  relativistic  electron  in  the  magnetic  field  due 
to  a  current  I  flowing  in  a  long  straight  wire  in  the  negative  z-direction.  In  a 
cylindrical  coordinate  system  (s,(i>,z)  the  magnetic  field  is  given  by 


g  =  -ML^ 
2ns 

and  the  equation  of  motion  of  the  electron  is 


(A.1) 


(A.2) 


where 


(A.3) 


Since  the  magnetic  field  does  no  work  on  the  electron  v  =0),  the  speed  v  is 
constant  and  the  equation  of  motion  becomes 


dv 

dt 


J^Ie_ 
^2ns  ym 


(A.4) 


u  Ic 

The  constant  — —  has  the  dimensions  of  velocity,  and  it  will  be  convenient  to 
Tnym 

introduce  the  dimensionless  constant 


2/r  ymv 
Mo  te 

and  write  Equation  (A.4)  in  the  form 


(A.5) 
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dt  £5 '  ’ 

The  components  v, ,  v  ^  and  v.  satisfy  the  equations 


1  2  V 

V - y  = - , 

s'  £S 


V.  = — V, 


(A-6) 


(A.7a) 


(A-Tb) 


(A.7c) 


Combining  Equations  (A.Tb)  and  (A-7c),  we  see  that  j|  =  0,  i.e. 

is  a  constant  of  the  motion.  Similarly,  it  follows  from  Equation  {A.7c)  that 


(A.8) 


,=.ex{-4.] 


(A.9) 


is  a  constant  of  the  motion.  Multiplying  (A.8)  and  (A.9),  we  see  that 

A 

sv^(and  hence  scos0)  is  also  a  constant  of  the  motion. 


The  decomposition  of  the  velocity  vector 
V  into  its  components  v, ,  v  ^  and  is  shown 
in  Fig.  7,  where 

v^=vcos0  (A.  10a) 

v^=vsin9sina  (A.  10b)  £ 

Vj  =v  sin0cosa  (A.10c) 


Ffg.  7.  Decomposition  of  eiectron  veiocity 

Substituting  Equations  (A.  10)  into  Equations 

(A.7),  we  obtain  a  pair  of  ioapled  equations  in  the  two  unknowns  9  and  a: 


V 

0  =  -sinacos0 


(A.  11a) 


a  = - [I-£cosacos9cot<?] 

es 


(A.  11b) 
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<to 


The  drift  velocity  v^,  of  the  electron  is  taken  to  be  the  average  ofv.  over  one  orbit  (27C 
increment  in  a).  We  may  thus  write  =Z  IT ,  where 


„  an^cosaexrfesinflcoscs] ,  _  extfesm0cosal  . 

Z  -  bSq} - ; - ;  r  =— r — - —da  (A.  12) 

0^  .  - y  ^l-cosacos0cot5  ^  ^ 


^  l-cosaccs5cot0 


Aft^  a  lengthy  calculation,  power  series  solutions  in  e  for  Z  and  T  are  obtained.  To 
first  order  in  £,  this  gives 


(A.  13) 

It  is  estimated^^  that  for  e  =0.05,  the  maximum  error  is  computing  the  drift  velocity 
is  less  than  1%. 
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Signal  pulse 


Fig.  1.  Basic  structure  of  an  ionization  chamber  (from  Ref.  9). 


Fig.  2.  Output  pulse  from  detector  circuit, 
(a)  d.c,  coupled  (b)  a.c.  coupled  (from  Ref.  9). 
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Fig.  3.  Silicon  p-n  junction  radiation  detector  (from  Ref.  14). 
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Heating 


Fig.  4.  The  thermoluminescence  process  (from  Ref.  9). 
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Fig.  5.  Relative  importance  of  x-ray  interactions  (from  Ref.  16). 


Collimated  x-ray 


Fig.  6.  Schematic  representation  of  the  Time  Projection  Comption  Spectrometer 

(from  ref.  6) 
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THE  EFFECT  OF  CARBON  PARTICLE  COMBUSTION  ON  THE 
INFRARED  SIGNATURE  OF  A  MAGNESIUM-FLUOROCARBON  FLARE 

Dr.  OUn  Perry  Norton 

Abstract 

A  burning  magnesIum-Teflon  flare  produces  a  plume  which  contains  a  substantial 
quantity  of  solid  carbon  particles.  As  the  flare  products  mix  with  the  surrounding  air, 
these  carbon  particles  will  bum.  The  combustion  of  carbon  particles  has  been  Incorporated 
Into  SMIRF  (Signature  Model  for  JnfraRed  Flares),  an  existing  model  for  predicting  the 
plume  structure  and  signature  of  these  flares.  It  Is  shown  that  the  inclusion  of  carbon 
particle  combustion  has  a  significant  effect  on  the  predicted  plume  signatures. 
Introdu.cttQn 

Teflon-magnesium  flares  are  used  as  Infrared  decoj^s.  Thus,  the  accurate  prediction 
of  their  Infrared  signatures  Is  of  considerable  Interest.  A  computer  model  for  the  prediction 
of  flare  signatures,  SMIRF  (Signature  Model  for  Infrared  Elares),  has  been  developed  at 
Arnold  Engineering  Development  Center.*  This  code  uses  the  existing  JANNAF  codes, 
originally  developed  for  rocket  plume  signature  prediction,  and  modifies  them  to  solve  the 
related  problem  of  predicting  the  flowfleld  and  Infrared  signature  In  the  plume  (or  wake) 
of  a  Teflon-magnesium  flare. 

Throughout  this  report,  results  will  be  cited  for  a  "typical"  flare.  This  Is  a  hypothetical 
flare  whose  characteristics  are  representative  of  actual  flares.  For  this  report,  this  generic 
flare  Is  a  recteingular  parallelplped  which  measures  2x3x5  inches  and  has  a  mass  of  1,4 
Ibm.  The  flare  composition  Is  54%  (by  mass)  Mg,  30%  C2F4,  and  16%  C2,2iF'2.84Hi.58'  'Tbfs 
solid  bums  at  a  rate  of  0. 144  inches/second,  giving  the  flare  a  lifetime  of  6.9  seconds  from 
Ignition  to  burnout.  For  the  calculations  to  be  presented  in  this  paper,  the  flare  Is  Ignited 
and  ejected  from  an  aircraft  at  an  altitude  of  9097  feet,  and  has  an  Initial  velocity  of  422 
fl/sec  horizontally  and  -15  ft/sec  vertically. 
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This  flare  Is  extremely  fuel  rich.  (The  equivalence  ratio  Is  5.)  When  It  bums,  the 
resulting  combustion  products  contain  several  species  that  will  undergo  further 
combustion  when  mixed  with  air.  The  NASA  chemical  equilibrium  code*  has  been  used  to 
predict  the  composition  of  the  combustion  products  of  this  typical  flare.  The  results  are 
given  in  Table  1. 


Table  1.  Flare  Combustion  Products 

Composition 

Mg(gas) 

38.8%  mole  fraction 

C(graphlte) 

30.2% 

MgF2(llquld) 

24.2% 

H2(gas) 

4.5% 

MgF(gas) 

1.5% 

Remainder 

<  1% 

Temperature 

1913  K 

Thus,  the  products  from  the  combustion  of  the  solid  flare  material  contain  significant 
quantities  of  magnesium  vapor  and  solid  carbon  (graphite)  particles,  both  of  which  will 
bum  as  the  plume  mixes  with  the  surrounding  atmosphere.  Further  calculations  were 
made  with  the  equilibrium  code  which  demonstrated  that  mixtures  of  flare  products  with 
air  will  reach  temperatures  above  2900  K.  This  is  a  significant  Increase  above  the  1913  K 
temperature  of  the  flare  product  itself  and  demonstrates  the  potential  influence  of 
afterburning  on  the  temperature  distribution. 

The  solid  carbon  particles  will  be  significant  emitters  of  Infrared  radiation,  so  the 
accurate  prediction  of  the  distribution  of  these  particles  will  be  Important  In  predicting 
the  plume  signature.  Furthermore,  the  combustion  of  the  carbon  particles  Is  the  only 
possible  source  of  CO  and  CO2.  gases  which  are  Important  emitters  of  Infrared  Information. 
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Based  on  these  considerations,  an  effort  was  begun  to  accurately  model  the 
combustion  of  carbon  particles  in  the  plume  of  a  Teflon-magnesium  flare.  In  previous 
work,  a  literature  survey  was  undertaken  to  identify  the  species  in  the  plume  which  react 
with  solid  carbon.®’*  These  species  were  identified  as  the  molecules  Oj,  CO2,  and  HjO,  and 
the  radicals  OH.  O.  and  H.  Rates  were  found  for  the  reactions  of  these  species  with  solid 
carbon  surfaces.  These  rates  are  reprinted  in  Table  2. 

Table  2.  Kinetic  Rates  for  Reactivity  of  Carbon  Surface  with  Gases 

Oa  (0o,ch«/(12  g/mole)  =  (k.,Po/(1  +  ’fs^o/1  "X) 

where 

X=(l  +  MKfl/’o^)-' 

=  (20  moles/(cm^  ■  sec  •  atm))  exp(-15100/r) 

=  (4.46  X 10'^  nioles/(cm^  •  sec  •  atm))  exp(-7640/7') 

K;.  =  (1.51  X 10“’  moles/(cm^  •  sec))  exp(-48800/r) 

K2  =  (21.3  atm"')  exp(2060/r) 

CO2  Wcojchexi  =  (247  g/(cm’  •  sec  ■  atm))  Pcoj  exp(-21 100/r) 

H2O  %jOch«.  =  (247  g/(cm’ •  sec  •  atm))  exp(-21 100/7) 

OH  =  (0.28)  (12  g/mole)  Pon  V/fr/2n(17  g/mole) 

O  (0.50)  (12  g/mole)  Po  V/f7/27i(16  g/mole) 

H  %ciiOT  =  (0.036)  (12  g/mo!e)  '^KTI2n{l  g/mole) 

Is  the  chemlcal  reaction  rate  at  which  species  X  removes  carbon  from  the  sur¬ 
face  in  units  of  grams  per  second  per  square  centimeter  of  surface. 

Px  is  the  partial  pressure  of  species  X  in  atmospheres. 

7  is  the  surface  temperature  in  degrees  Kelvin. 

Px  denotes  the  concentration  of  species  X  in  moles  per  unit  volume. 

K  is  Boltzmann’s  constant. 


When  these  rate  expressions  were  evaluated  under  typical  plume  conditions  the 
species  CO2,  OH,  and  O  were  the  most  important  reactants  with  the  solid  carbon.  It  was 
found  that,  for  particles  less  than  10  microns  In  diameter,  the  particle  combustion  was 
klnetlcally  rather  than  diffusion  limited,  so  that  the  reactant  concentration  at  the  particle 
surface  could  be  approximated  by  its  bulk  concentration.  Furthermore,  any  temperature 
difference  between  the  gas  and  the  particles  decreases  as  the  particle  size  Is  reduced,  so 
that  small  particles  can  be  regarded  as  being  at  the  same  temperature  as  the  gas. 

References  3  and  4  contain  more  complete  descriptions  of  the  previous  work  on  this 
problem.  Including  bibliographical  references  for  the  reaction  rates  In  Table  2.  The  present 
work  Is  a  continuation  of  this  previous  work  on  carbon  particle  combustion,  and  consists 
of  Incorporating  these  carbon  particle  reactions  Into  the  SMIRF  flare  model  and  demon¬ 
strating  their  effects  on  the  predicted  signature. 

Discussion 

The  objective  is  to  Include  reactions  for  solid  carbon  particles  In  the  existing  SMIRF 
flare  model.  To  calculate  the  structure  of  a  reacting  plume,  SMIRF  Includes  a  modified 
version  of  SPF,  the  standard  JANNAF  plume  model.®  Thus,  the  problem  becomes  one  of 
Including  a  rudimentary  description  of  reacting  particles  In  SPF. 

The  difficulty  posed  Is  this:  although  SPF  does  have  the  ability  to  Include  some 
particle  effects,  this  ability  Is  restricted  to  nonreacting  particles  of  aluminum  oxide.  The 
solid  species  carbon  will  be  treated  as  If  it  were  a  gas,  in  that  SPF  will  compute  a  "con¬ 
centration"  of  solid  carbon  (so  many  moles  of  solid  carbon  per  unit  volume)  at  each  grid 
point.  This  pseudo-gas  approximation  creates  difficulties  when  calculating  chemical 
reaction  rates  involving  the  particulate  species.  The  reaction  types  Included  In  SPF  are 
valid  for  chemically  reacting  gaseous  species.  For  example,  for  a  reaction  between  two 
gaseous  species  called  A  arid  B,  the  reaction  rate  Is  written  In  terms  of  the  concentrations 
of  the  two  species: 

Pa  Pb  (1) 
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where  k  Is  the  rate  constant,  which  r  lay  be  a  function  of  temperature,  and  and  pa 

represent  the  concentrations  of  species  A  and  B.  Depending  on  the  units  chosen  for  k, 
these  concentrations  are  usually  expressed  as  moles  per  cubic  centimeter  or  grams  per 
cubic  centimeter. 

The  reaction  rate  should  be  calculated  differently  if  one  species  (B  for  example)  is 
in  the  form  of  a  particle  or  droplet.  Assuming  that  the  reactions  are  klnetlcally  limited,  the 
rate  of  the  reaction  is  proportional  to  the  exposed  surface  area  of  species  B: 

^  Pa  (^) 

where  o,  is  the  total  exposed  surface  area  of  the  particles  of  species  B  per  unit  volume  and 

k'  Is  the  surface  rate  constant. 

Uniform  Particle  Diameter  ApproxlmaUon 

The  simplest  model  for  the  particle  size  distribution  of  the  solid  carbon  particles  Is 
to  assume  that  all  particles  are  spheres  of  uniform  diameter.  Letting  ti  be  the  number  of 
carbon  particles  per  unit  volume  and  d  stand  for  their  diameter,  then  the  pseudo-gas 
concentration  of  carbon  is 

Pc  ^  I  Pc  n  (3) 

where  pc  Is  the  density  of  solid  carbon,  approximately  0.17  moles  per  cubic  centimeter 

(2. 1  grams  per  cm^).  Note  that  this  Is  different  from  the  pseudo-gas  concentration  (or 
density)  pc,  which  Is  the  amount  of  carbon  found  In  one  cubic  centimeter  of  the  particle 
laden  gas.  The  total  carbon  surface  area  per  unit  volume,  Oc,  Is  given  by: 

Oc  =  n  Ti  .  (4) 

Equations  3  emd  4  can  be  used  to  find  Oc,  the  carbon  surface  area  per  unit  volume. 
In  terms  of  the  pseudo-gas  concentration  of  carbon,  pc  and  the  particle  diameter: 


Ifareasonable  estimate  can  be  made  for  the  diameter  of  the  solid  carbon  paitides. 
then  Equation  5  can  be  used  to  convert  the  pseudo-gas  carbon  concentration  into  the 
reacting  surface  area  per  unit  volume.  Next,  suppose  that  the  reactionrate  for  solid  carbon 
is  given  in  the  form  of  Equation  2.  with  carbon  in  the  place  of  the  prevlousfy^  hypothetical 
solid  reactant  B.  Let  the  gaseous  reactant  continue  to  be  be  denoted  \jy  A.  with  the 
understanding  that  A  will  eventually  staru!  for  either  COj.  OH.  or  O.  Then,  the  reaction 
rate  is 


A  slight  rearrangement  of  Equation  6  yields: 

(S)  -■  ►  ■ 


Equation  7  is  identical  to  Equation  1 .  provided  that  the  effective  reaction  rate  for  the  carbon 
pseudo-gas  is  set  equal  to 


k 


(8) 


This  reaction  is  in  a  form  that  can  used  by  SPF. 

A  degree  of  approximation  is  involved.  In  reality,  the  particle  diameter  will  decrease 
as  the  particles  are  consumed,  and  thus  the  particle  dlameterwlll  not  be  constant  However, 
the  rates  computed  by  this  method  should  at  least  be  the  right  order  of  magnitude. 
Log-Normal  Particle  Size  Distribution 

Suppose  the  carbon  particle  diamete?s  are  not  all  the  same,  but  are  statistically 
distributed.  The  parucle  size  distribution  function  is  defined  In  the  same  way  as  a  prob¬ 
ability  distribution  function:  the  fraction  of  tlie  total  number  of  particles  with  diameters 
between  d^  and  <4  Is  equal  to  the  Integral  of  6(d)  from  d,  to  d^.  Tliere  are  several  standard 
distribution  function  which  are  corrunonly  used  to  describe  particles.  One  of  the  most 
common  Is  the  log-normal.  This  distribution  function  can  be  written  in  the  form: 


m 


cxp(-9/45’) 


exp(-5V^) 


(9) 
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where 


y  =  MdIdJ  .  {10) 

Here  <4  S  are  pajamete*^  that  describe  the  distribution  function,  corresponding  to  a 

characteristic  diamrter  and  a  standard  deviation,  respective^. 

Assuming  a  log-normal  particle  size  distribution,  equations  can  be  derived,  corre- 
spondiiigtothe  onesinthe  prevloussection.  toyield  anelTectiv^reactlonrate.  The  analogue 
(^Equation  3  is: 

Pc  =  I  Pc  n  d’  exp(-9/45^  (11) 

and.  in  place  of  Equation  4: 

Oc  =  n  r\  dl  exp(-2/5^  .  (12) 

Then,  corresponding  to  Equation  8. 

k  =  exp(l/4S*)  .  (13) 

Equation  13.  which  was  derived  assuming  a  log-normal  distribution,  is  nearly  the  same 
as  Equation  8.  which  was  derived  assuming  that  all  particles  are  the  same  diameter.  The 
onty  differences  are  that  the  "mean”  diameter  d„  appears  in  place  of  the  particle  diameter, 
and  the  appearance  of  the  additional  multiplying  factor  exp{l/45^. 

Results  of  Calculations 

t  ^ore  evaluating  the  effects  of  cart  on  particle  combustion,  a  few  additional  required 
reactions  were  added.  SMIRF  uses  SPF  to  perform  the  reacting  plume  calculation,  and 
SPF  gets  its  reaction  rates  from  an  input  file  called  DATBANK.  Althought  the  entire 
DATBANK  file  will  not  be  reproduced.  Table  3  contains  an  excerpt  from  the  DATBANK  file 
which  shows  the  reactions  of  interest. 

When  these  calculations  were  first  begun,  the  magnesium  vapor  in  the  flare  plume 
was  not  burning  as  it  mixed  with  the  oxygen  In  the  air.  Thus,  reaction  number  168  was 
added.  It  was  also  necessary  to  add  the  condensation  reactions  172  and  173.  The  rates 
assignee  to  these  reactions  are  not  measured  reaction  rates;  the  numbers  were  determined 
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bynmnlng  the  flare  model  again  and  again  with  faster  and  faster  rates  for  these  reactions 
until  the  reactions  seemed  to  be  in  equilibrium.  Thus,  the  results  presented  here  are  based 
on  the  assumption  of ’’fast"  chemistry  (Le..  equilibrium)  for  reactions  168. 172.  and  173. 


Table  3.  ESccerpt  from  DATBANK  File 


N 

Reaction 

T 

A 

n 

E 

166 

C 

+02 

=CO 

+0 

61 

l.OE-25 

0.0 

-5000. 

167 

MGF 

+MGF 

=MGF2 

+MG 

18 

3.5E-25 

-5180. 

168 

MG 

+02 

=MGOS 

+0 

15 

l.OE-09 

0.0 

-16500. 

169 

CGR 

+0 

=CO 

48 

3.3E-16 

-0.5 

0. 

170 

CGR 

+OH 

=CO 

+H 

18 

1.8E-16 

-0.5 

0. 

171 

CGR 

+C02 

=CO 

+CO 

18 

1.3E-15 

-1.0 

-41926. 

172 

MGO 

+MGO 

=MGOS 

+MGOS 

11 

l.OE-08 

173 

MGF2 

+MGF2 

=MGFL 

+MGFL 

11 

l.OE-10 

The  numbers  in  column  N  are  the  reaction  numbers.  T  is  a  two  digit  code  which  specifies 
the  reaction  type.  The  documentation  for  SPF  should  be  consulted  for  more  details.® 
The  numbers  A.  n.  and  E  describe  the  variation  of  the  rate  constant  with  temperature 
according  to  the  Arrhenius  equation:  k=AT*  exp{EIRT).  T  is  the  temperature  and  R  is 
the  ideal  gas  constant  The  activation  energy  E  is  in  kcal/mole;  n  is  dimensionless.  The 
units  of  A  vary  according  to  the  order  of  the  reaction,  but  may  be  found  for  any  given 
reaction  from  the  specification  that  the  concentrations  are  in  molecules  per  cubic 
centimeter  and  the  resulting  reaction  rate  is  in  molecules  per  cubic  centimeter  per 
second. 

Next,  the  reactions  for  the  caibon  particles  were  added.  These  reactions  are  numbeis 
169,  170,  and  171  in  Table  3,  and  express  the  reaction  of  the  solid  carbon  with  the  three 
species  which  were  previously  found  to  be  most  Important  The  rates  for  these  reactions 
were  calculated  using  the  surface  reaction  rates  from  Table  2  for  O,  OH,  and  COj  together 
with  Equation  13.  The  diameter  d„  was  assumed  to  be  one  micron,  and  the  distribution 
parameter  5  was  1.1.  The  reaction  rates  input  to  SPF  via  DATBANK  are  in  the  molecule- 


cm-sec  ^stem  of  units,  so  an  appropriate  conversion  factor  was  applied.  The  resulting 
reaction  rate  constants  are  given  in  Table  3  for  reactions  169,  170,  and  171. 
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There  arc  no  rclishle  pactivlfc  3ize  raeasaremenis  for  these  flares.  However,  one 
micron  seems  to  be  a  leascnahle  estimsle  for  the  diameter  of  the  carbon  particles,  and 
thus  the  reaction  rates  In  Table  3  for  the  solid  carbon  should  at  least  be  of  the  right  order 
oi  magnitude.  Furthermore,  when,  the  model  is  applied  to  actual  flares  for  which 
signature  measurements  are  available,  this  assumed  particle  dimeter  might  be  varied  to 
improve  the  fit 

F^ure  1  illustrates  the  variation  of  concentration  as  a  function  of  distance  from  the 
centerline  of  the  plume  (Y).  This  distance  was  made  dimensionless  by  dividing  by  R  (here 
equal  to  0.13  feet).  The  radial  profile  of  Figure  1  is  at  a  station  5  feet  aft  of  the  flare.  The 
calculation  was  made  for  a  flare  which  h  /.s  just  been  deployed.  This  result  was  computed 
using  the  carbon  particle  reaction  rates  from  Table  3.  assuming  a  one  micron  diameter. 

Figure  2  shows  the  temperature  profile  tir  the  plume.  Here,  the  temperature  profile 
is  calculated  for  three  different  scenarios.  First,  the  code  was  run  with  the  carbon  particle 
reactions  omitted.  Next,  the  particles  were  allowed  to  bum.  using  th^‘  reaction  rates  in 
Table  3,  which  assume  a  one  micron  particle  diameter.  The  purpose  was.  to  examine  the 
effect  produced  by  allowing  the  carbon  particles  to  bum.  Then,  the  ci.<le  was  mn  a  third 
time,  ontywith  the  ratesforreactions  169. 170.  and  171  increased  byS  orders  of  magnitude. 
These  greatly  Increased  reaction  rates  were  Intended  to  simulate  fast"  reactions  and 
produce  an  equilibrium  condition.  Thus,  these  calculations  were  ''iittxicled  to  exj/ore  the 
range  of  possibilities  from  no  particle  reactions  at  all  to  fast  reactions. 

The  temperature  profile  in  Figure  2  shows  some  signs  of  oumen.-al  instab/it!  /  {e.g., 
the  kink  In  the  curve  for  no  particle  reactions  at  Y/R  of  0.5).  Tb  f  caiCuJa  dons  wei «  vpeated 
(results  not  shown)  with  a  smaller  step  size,  and  a  changf  ,  rs  no'  ed  in  the  ref  n  from 
Y/R  of  0.4  to  0.7.  Thus,  the  variation  In  the  temperature  profiles  In  this  ;eglor  ,  i  ay  not 
be  significant.  This  area  requires  more  work.  However,  the  :  ^nipcrat  vice  profile.!  iii  Jie  rest 
of  the  plume  do  appear  to  be  correct.  The  solid  carbon  psu'-iclc  reactions  lo  make 

little  difference  In  the  maximum  temperature.  The  only  rf  ai  eficct  of  rcarJ  .oos  seems 
to  be  an  Increase  In  the  width  of  the  profile. 
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F^ure  3  shows  the  variation  of  the  mass  iiraction  of  solid  carbon  with  radius.  With 
no  part  fde  reactions,  the  decrease  in  solid  carbon  density  toward  the  outer  edge  of  the 
plume  is  due  entiiefy^  to  dilution.  When  the  carbon  reactions  are  enabled,  the  effect  is 
dearfy  seen  in  the  form  of  a  decrease  in  the  carbon  concentration.  This  effect  occurs  again 
when  the  reaction  rates  are  increased. 

Figure  4  shows  the  concentration  of  carbon  monoxide.  With  no  carbon  pariide 
reactions,  there  is  no  pathway  for  the  formation  of  carbon  monoxide,  so  the  concentration 
is  exactly  zero  everywhere  in  the  plume.  When  the  reactions  are  enabled,  carbon  monoxide 
is  fonned  where  the  carbon  partide  laden  flare  products  mix  with  the  air. 

Figure  5  shows  a  similar  effect  on  the  concentration  of  carbon  dioxide.  The  peak 
carbon  dioxide  concentration  occurs  at  a  larger  radius  than  the  peak  carbon  monoxide 
concentration.  This  can  be  interpreted  as  the  result  of  a  two  step  oxidation  process.  Under 
very  fuel  rich  conditions,  the  monoxide  is  formed,  which  then  reacts  further  when  it  reaches 
the  outer  portion  of  the  plume  where  oxygen  is  more  plentiful.  Since  most  of  the  heat 
released  by  burning  carbon  is  due  to  the  second  oxidation,  this  would  explain  the  chariges 
seen  in  tlie  temperature  profile  (Figure  2). 

The  most  important  result  is  given  in  Figure  6,  which  shows  the  plume  signature 
computed  for  these  three  different  reaction  scenarios.  The  radiation  intensity  was  spectrally 
integrated  from  2000  to  5000  cm"'  and  spatially  integrated  over  tire  plume.  This  calculation 
was  repeated  at  one  second  Intervals  from  ignition  to  burnout,  resulting  in  a  plot  of  the 
total  emissive  power  versus  time. 

Figure  6  demonstrates  the  effect  of  including  the  carbon  particle  reactions.  Going 
from  no  carbon  particle  reactions  at  all  to  the  fast  reaction  (equilibrium)  limit  makes  an 
order  of  magnitude  difference  in  the  integrated  total  emissive  power  of  the  plume.  Using 
the  computed  reaction  rates  for  one  micron  carbon  particles,  the  answer  falls  between 
these  two  extreme  limits. 
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Condusions 


A  method  has  been  found  which  allows  the  indusion  of  Tiding  partide  effects  in 
existing  computer  programs  which  are  written  to  handle  chem^call^  reacting  gases.  This 
apprcndmation  treats  the  particulate  species  as  pseudo-gases,  and  computes  effective  rates 
for  reactions  involving  these  species  based  on  an  assumed  partide  diameter. 

The  plumes  of  Teflon-magnesium  flares  contain  large  quantities  of  unreacted  carbon 
in  the  form  of  solid  paitides.  Reactions  between  these  solid  tmrbon  partldes  and  the 
gaseous  species  O.  OH.  and  CO2  were  added  to  the  SMIRF  flare  model.  The  results  indicate 
that  these  gas-partlde  reactions  may  make  an  order  of  magnitude  difference  in  the  pre¬ 
dicted  flare  signature. 

The  flare  sigriature  calculated  by  using  the  actual  reaction  rates  for  the  particulate 
reactions  is  clearly  different  from  the  signature  calculated  by  either  Ignoring  these  reac¬ 
tions.  or  by  making  them  so  fast  as  to  be  in  equilibrium.  Thus,  accurate  prediction  of  flare 
signatures  will  require  that  the  kinetics  of  these  reactions  be  modelled.  They  carmot  be 
Ignored,  and  they  carmot  be  assumed  to  be  fast. 
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Plume  Concentration  Profile 
Generic  Flare,  T=0  sec,  X=5  ft 


YA 


Figure  1 .  A  plot  of  the  concentrations  of selected  species  as  a function  of  radial  distancefrom 
the  centerline  of  the  plume.  The flare  has Just  been  released  and  ignited  (t=0  sec),  and  this 
cross  section  of  the  plume  is  at  a  distance  5  feet  downstream  from  the  flare  (x=5  ft).  The 
species  C(gr)  is  graphite,  or  solid  carbon. 
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Plume  Temperature  Profile 
Generic  Flare,  T=0  sec,  X=  5  ft 


Y/R 

Figure  2.  Under  the  same  conditions  as  in  Figure  1  (t~0  sec,  x=5  Ji),  the  radial  temperature 
profile  is  shown.  Three  different  curves  are  shown  corresponding  to  no  reactions  for  carbon 
particles,  reaction  rates  calculated  based  onal  micron  particle  diameter,  and  extremely 
fast  reactions  (equilibrtum). 
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Mass  Fraction 


C(gr)  Concentration  Profile 
Generic  Flare,  T=0  sec,  X=5  ft 


Y/R 


Figure  3.  A  plot  of  the  carbon  particle  density  in  the  plume.  As  before,  this  plot  corresponds 
to  t=0  seconds  and  x=5  feet 
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Mass  Fraction 


CO  Concentration  Profile 
Generic  Flare,  T=0  sec,  X=5  ft 


Figure  4,  A  plot  of  the  concentration  of  carbon  monoxide  in  the  plume.  This  plot  corresponds 
to  t=0  seconds  and  x=5  feet 
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Intensity  (W/sr) 


Figure  6.  A  plot  of  the  intensity  of  infrared  emissions  from  the  flare  as  a  function  of  time. 
The  emissions  have  been  integrated  spatially  and  spectrally  (from  2000  to  5000  cmi^). 
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ABSTRACT 


Mathematical  morphology  is  a  powerful  tool  for  image  aslaysis  and  enhancement.  Morphologi¬ 
cal  operators  are  si  ape-dependent,  nonlinear  image  transforms  such  as  ersosion,  dilation,  opening, 
closing,  and  rank  filters.  The  mathematical  operators  are  defined  in  n  dimensions  so  it  is  possible 
to  create  programs  that  will  operate  on  ID  signals,  2D  images,  or  3D  datasets  using  exactly  the 
same  concepts.  As  operators  on  2D  binary  images,  mathematical  morphology  is  well  known;  much 
software  is  available  which  performs  binary  morphology.  However,  the  true  power  in  morphology 
lies  in  its  ability  to  transform  grayscale  2D  pixel  images  and  3D  voxel  datasets.  Yet  there  is  no 
commercially  available  software  that  performs  these  functions. 

This  is  a  description  of  a  morphological  software  package  written  under  the  auspices  of  the  US 
Airforce  Office  of  Scientific  Research  at  Arnold  Airforce  Base  during  the  1991  Sununer  Faculty 
Research  program.  The  software  includes  a  2D  image  morphology  program,  a  3D  voxel  image 
morphology  program,  a  program  for  enhancement  and  noise  reduction  of  2D  images,  and  related 
support  routines  for  image  arithmetic  and  logical  operations.  These  programs  will  perform  many  -  if 
not  all  -  of  the  possible  morphologial  operations  on  both  binary  and  grayscale  images.  They  provide 
many  of  the  commonly  used  features  automatically  or  permit  the  user  to  customize  the  operation  to 
fit  an  application.  The  user  programs  operate  on  Sun  rasterfiles.  However,  the  main  morphological 
routines  are  c-code  subroutines.  These  are  independent  of  specific  file  formats,  so  a  users  can  easily 
write  an  interface  program  to  operate  on  other  file  formats. 
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I.  INTRODUCTION 


Mathematical  morphology  is  the  name  applied  to  a  collection  of  set  theoretic  operators  defined  on 
a  abstract  stiTicture  known  as  in  infinite  lattice.  These  operators  were  first  examined  ^stematically 
by  Math^on  [6]  and  Serra  [8],  [9],  [10]  in  the  1960’s,  are  an  extension  of  Minkowsky’s  set  theory. 
They  ate  essedally  useful  for  image  analysis  and  image  enhancement.  There  are,  in  the  general 
literature,  a  number  of  good  tutorials  on  image  morphology  [1],  [2],  [3],  [4],  [5],  [9].  We  refer  the 
novice  to  any  of  these  p^ers  for  the  fundamentals. 

Morpholo^cal  onerators  include,  erosion,  dilation,  opening,  closing,  rank  filters  (includes  median 
filter),  tophat  tratsfcrms,  and  generalized  correlations.  These  operations  can  be  defined  on  binary 
or  grayscale  images  in  any  number  of  dimensions.  A  morpholo^cal  operator  is  governed  by  a 
small  pseudo  image  called  a  “structuring  element”  (SE).  When  applied  to  an  image,  the  operator 
returns  a  quantitative  measure  of  the  image’s  geometrical  structure  in  terms  of  the  SE.  This  measure 
can  be  used  to  isolate  features  in  an  image  or  to  construct  a  synthetic  image  containing  regular 
approximations  of  the  features  in  the  original.  A  “regular”  feature,  in  this  sense,  is  a  feature  that 
can  be  reconstructed  uniquely  by  the  operator  using  the  SE. 

Certmn  morphological  operators,  when  applied  to  2D  grayscale  images  or  3D  voxel  images  return 
a  synthetic  image  that  Is  -  in  a  sense  -  a  smoothed  version  of  the  original  image.  But,  the  smoothing  b 
nonlinear.  Unlike  a  linearly  smooth  image,  a  morphologically  smooth  image  usually  contains  sharp 
edges.  The  pieces  of  the  original  image  that  are  not  present  in  the  transformed  image  are  those 
features  that  could  not  be  approximated  by  the  SE.  These  pieces  are  in  the  residual,  the  difference 
between  the  original  image  and  the  smoothed  version.  Both  the  smooth  output  and  the  residual 
can  be  quite  useful.  The  smooth  output  b,  in  a  sense,  a  root  or  core  image.  It  is  simpler  than 
the  original  while  retaining  important  characterbtics.  Thb  can  make  analysis  simpler.  The  residual 
contains  all  “irregular”  features  such  as  noise  and  edges  that  could  not  be  approximated  by  the  SE. 
With  a  few  assumptions  about  the  differences  between  noise  and  edges,  one  can  remove  much  of 
the  noise  from  the  residual  image.  Adding  the  “cleaned-up”  residual  back  to  the  regular  features 
yields  a  relatively  noise  free  image  with  many  edges  intact  and  unblurred.  (Note:  We  have  included 
a  program,  mclean,  in  our  package,  which  does  precisely  that.) 

Mathematical  morphology  extends  to  the  analysis  of  3D  data  sets  in  two  ways.  Morphological 
operators  applied  to  three  dimensional  voxel  images  such  as  the  output  from  a  CT,  MRI  or  PET 
scanner  are  completely  analogous  in  effect  to  those  applied  to  2D  imagery.  A  morphological  operator 
will  approximate  within  specific  3D  shape  constraints,  the  density  or  intensity  of  voxeb  in  space. 
Thus,  nonlinear  smootings  and  residuals  can  be  constructed  for  analysis  or  enhancement  of  a  3D 
image. 

Three  dimensional  morphological  operators  are  quite  useful  in  another  context.  Consider  that 
a  time  sequence  of  2D  images  can  be  thought  of  a  3D  image  in  spacetime  (2  space  eixes,  1  time 
axis).  If  we  treat  time  as  a  spatial  dimension,  we  can  apply  3D  morphology  to  a  time  sequence  of 
images.  The  properties  of  spacetime  can  be  exploited  by  morphological  processing  for  the  analysis 
and  enhancement  of  moving  imagery. 

A  true  object  (not  an  artifact  of  imaging)  will  exist  in  multiple  frames  of  a  time  sequence,  as  long 
as  it  is  not  moving  too  rapidly  through  the  field  of  view.  Noise,  however,  and  other  spurious  artifacts 
of  the  imaging  process  will  change,  often  dramatically,  from  frame  to  frame.  An  appropriate  3D 
morphological  of'crator,  applied  to  such  a  sequence  can  preserve  the  persistant  shapes  and  discard 
those  which  char  ge  rapidly.  This  researcher  has  observed  that  simple  3D  morphological  operations 
applied  to  a  noisey  time  sequence  can  reduce  dramatically  the  apparent  noise. 

Two  dimensional  objects  in  a  time  sequence  of  images  become  3D  generalized  cylinders,  or 
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‘Hronns”  in  3D  spacetime.  The  pitch  or  angle  these  worms  make  with  respect  to  the  time  axis  is 
directly  related  to  the  vdocity  with  which  the  objects  are  moving  in  the  time  sequence.  It  is  a  simple 
matter  to  design  structuring  elements  that  are  sensitive  to  spedhc  velocities  or  ranges  of  velocities 
dnce  it  is  simple  to  specify  the  shape  of  the  SE.  Thus,  it  is  possible  to  use  3D  morphology  to  enhance 
or  filter  out  objects  moving  with  spedfic  vdocites  in  an  image  sequence. 

Mathematical  morphology  is  of  unquestionable  utility  in  the  analysis  and  enhancement  of  very 
noisey  imagery  like  that  processed  by  scientists  and  en^eers  at  Arnold  Engineering  and  Devdop- 
ment  Center.  This  software  will  give  them  the  capacity  to  perform  both  2D  and  3D  morphology  on 
their  imagery. 


II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 

The  objective  of  this  research  effort  was  to  provide  the  scientists  and  engineers  at  Arnold  Engi¬ 
neering  and  Development  Center  with  robust,  reliable,  and  easy  to  use  software  for  two  and  three 
dimensional  mathematical  morphology.  The  programs  had  to  be 

•  Comprehensive.  It  should  implement  virtually  every  conunonly  used  morphological  oper¬ 
ation.  It  should  operate  on  both  binary  and  grayscale  imagery.  It  should  implement  both 
set  and  function  morphology  and  hit-or-miss  transforms.  It  should  be  able  to  use  structuring 
elements  of  any  size,  shape,  or  connectivity. 

•  Flexible.  It  should  be  easily  extensible  to  add  new  functionalites  or  new  structuring  elements. 
It  should  be  easily  modifiable  to  operate  on  images  in  a  variety  of  formats.  It  should  run  on 
any  Unix  workstation  with  minimal  changes. 

•  Robust.  It  should  be  as  bug-free  as  possible.  It  should  be  predictable;  similar  inputs  under 
similar  operators  should  produce  similar  outputs.  Novel  uses  withinin  reason  should  not  crash 
the  program. 

The  author  feels  confident  that  he  has  met  the  first  objective.  While  it  is  trivial  to  add  new 
structuring  elements,  the  second  objective  is  not  as  straightforward  to  accomplish  in  every  respect. 
It  was  not  possible  to  write  a  specific  user  interface  for  the  program  and  have  it  read  any  image  file 
format.  The  author  got  as  close  as  possible  to  that  ideal  by  writing  the  main  2D  and  3D  morphology 
procedures  as  subroutines,  decoupled  from  any  specific  file  type.  He  also  wrote  user  interface  -  I/O 
programs  that  read  and  write  Sun  rasterfiles  and  pass  data  to  the  subroutines.  Users  wishing  to  use 
other  file  formats  will  either  need  to  use  file  conversion  programs,  like  pbmplus  or  image  alchemy, 
to  convert  their  formats  to  rasterfiles  or  they  will  need  to  write  their  own  file  I/O  program  that 
calls  the  morphological  subroutines.  The  programs  have  been  written  making  no  use  of  SunOS- 
specific  system  calls.  They  do  not  employ  a  graphical  user  interface  -  they  use  a  simple  command 
line  interface.  This  should  ensure  portability  into  most  Unix  environments.  In  fact,  the  programs 
should  run  with  few  changes  on  any  PC  that  has  linearly  addressable  memory  such  as  an  Amiga  or 
Macintosh,  providing  it  has  suffcient  memory. 


m.  2D  IMAGE  MORPHOLOGY  SOFTWARE 


In  this  section  we  will  detail  the  use  of  MorpkSuh,  the  2D  morphology  subroutine,  and  morph, 
its  Sun  rasteriile  implementation. 

MorphSub 

Subroutine  MorpkSub  performs  many  (if  not  all)  possible  2D  morpholo^cal  operations  on  gray- 
level  or  binary  images.  This  includes  hit-or-miss  transforms,  order  statistic  filters,  function  process¬ 
ing  or  set  processing  erosion,  dilation,  opening  and  closing,  tophat  (image  minus  opening),  bothat 
(closing  minus  image),  and  isolated  delete  functions. 

To  use  make  the  follovdng  call  in  a  c  program: 


MorphSub(  MOp,  In,  Out,  Ix,  ly,  ImgType,  KZPad,  LTbresh,  UThresh, 

SEName,  AutoSE,  Sx,  Sy,  Sz,  SEType,  SorF,  Rank,  NoScale,  Dsp) 

where  the  parameters  are  defined  by: 


int 

MOp; 

Morphological  operation  to  perform 

byte 

♦In; 

input  image  as  byte  list  in  row  major  order 

byte 

♦Out; 

output  image  as  byte  list  in  row  major  order 

int 

Ix,Iy; 

image  horizontal,  vertical  dimensions 

int 

ImgType; 

image  type  (gray-level  or  binary) 

int 

NZPad  ; 

flag.  F  =>  zeropadding  of  input 

int 

LThresh; 

lower  binary  threshold  value 

int 

UThresh; 

upper  binary  threshold  value 

char 

♦SEName; 

structuring  element  (SE)  path  name 

int 

.\utoSE; 

use  a  canned  or  auto  generated  SE 

int 

Sx,Sy,Sz; 

SE  X  and  y  support  dims  and  max  gray-lev  (z) 

int 

SEType; 

Binary  SE  or  gray-level  SE 

int 

SorF; 

Set  operation  or  Function  operation 

int 

Rank; 

rank  for  rank  filter 

int 

NoScale; 

flag.  T  do  not  scale  output  of  IntMorph 

int 

Dsp; 

flag.  T  =>  display  some  info 

A  detailed  explanation  of  parameters  follows: 

Mop  Morphological  Operation.  This  integer  parameter  can  have  the  following  values; 


mnemon 

hex 

dec 

action 

ERODE 

O.X0001 

1 

erosion 

DILATE 

0x0002 

2 

dilation 

OPEN 

0x0004 

4 

opening 

CLOSE 

0x0008 

8 

closing 

RANK 

0x0010 

16 

rank  filter  (order  statistic  filter) 

TOPH.AT 

0x0020 

32 

tophat  (image  minus  opening) 
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BOTHAT 

0x0040 

64 

bothat  (closing  minus  image) 

Moreover,  the  erode  and  dilate  functions  can  be  “negated”  using: 

mnemon 

hex 

dec 

action 

NOTFLG 

0x0080 

128 

“not”  flag  for  binary  erodes  and  dilates 

Let  I  =  original  image;  E  =  eroded  image;  D  =  dilated  image.  The  following  are  valid 
parameters: 


mnemon 

hex 

dec 

action 

ERODE  1 

NOTFLG 

0x0081 

129 

I  &&  !E  (pixelwise) 

DILATE 

1  NOTFLG 

0x0082 

130 

D  kb.  !I  (pixelwise) 

ERODE  i 

1  NOTFLG  with  a  binary  hit-or-miss 

structuring  element  will  delete  in  a 

binary  image,  white  features  with  the  shape  of  the  “hit”  portion  of  the  SE.  (e.g.  one 
can  easily  devise  a  SE  to  delete  isolated  white  pixels).  With  a  gray-level  SE  it  will 
delete  the  “interiors”  from  sets  of  white  pixels  in  a  binary  image. 

DILATE  1  NOTFLG  with  a  binary  hit-or-miss  structuring  element  will  delete  in  a 
binary  image,  black  features  with  the  shape  of  the  “hit”  portion  of  the  SE.  (e.g.  one 
can  easily  devise  a  SE  to  delete  isolated  black  pixels).  With  a  gray-level  SE  it  will 
delete  the  “interiors”  from  sets  of  black  pixels  in  a  binary  image. 

In  The  input  image.  A  pointer  to  an  Ix-times-Iy  long  list  of  bytes.  The  byte  list  is  a 
one-byte  per  pixel  grayscale  image  that  is  ly  rows  by  Ix  columns  in  row  major  order. 
(First  row  of  pixels,  followed  by  second  row,  followed  by  third,  . . .). 

Out  The  output  image.  A  pointer  to  an  Ix-times-Iy  long  list  of  bytes.  The  byte  list  is  a 
one-byte  per  pixel  grayscale  image  that  is  ly  rows  by  Ix  columns  in  row  major  order. 
This  is  where  morph  will  write  its  output.  The  space  must  be  allocated  by  the  calling 
program. 

Out  and  In  may  point  to  the  same  space  without  error.  However,  morph  will  then 
overwrite  the  input  image  with  the  output. 

Ix  Image  horizontal  dimension.  Number  of  pixels  per  line  (or  number  of  columns  per 
row)  in  image. 

ly  Image  vertical  dimension.  Number  of  lines  (or  rows)  in  the  image, 

ImgType  Imag®  type:  This  specifies  whether  the  input  image  is  to  be  treated  like  a  gray  level  or 

binary  image.  The  values  in  a  binary  image  are  zero  and  not-zero  (it  does  not  matter 
which  of  the  values  in  [BLACK,WHITE]). 

Parameter  values: 
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NZPad 


LThresh 

UThresh 


SEName 


AutoSE 


mnemon 

hex 

dec 

meaning 

GRAIMG 

0x0000 

0 

gray-level  image 

BINIMG 

00x0200 

512 

binary  image 

If  zero  (FALSE)  MorpkSub  will  extend  the  size  of  the  input  image  internally  and  create 
a  border  of  zeros  around  it.  If  nonzero  (TRUE)  MorphSnb  will  not  do  the  zero  padding. 
Note:  this  is  all  internal  to  AforphSub.  The  input  and  output  images  are  both  Ix  by 
ly  independent  of  this  flag. 

There  are  image  border  effects  with  each  option.  The  effects  differ  depending  on  the 
option.  With  the  option  FALSE,  morph-sub  does  what  it  can  to  “color  in'*  the  image 
netir  its  perimeter.  With  this  option  TRUE,  the  output  image  has  a  border  of  zeros 
inside  it  the  width  and  height  of  the  SE.  That  is,  the  transformed  area  of  the  output 
image  is  smaller  than  the  actual  image  dimensions.  This  is,  in  a  sense,  a  more  accurate 
result  than  the  zero  padded  default.  To  zero  pad  the  input  permits  the  program  to 
transform  the  border  region,  hut  it  does  this  on  the  assumption  that  the  original  scene 
was  black  outside  the  image.  This,  of  course,  is  almost  never  true.  Thus,  the  border 
region  is  inaccurately  transformed.  Use  this  sv/itch  if  accuracy  is  more  import2mt  than 
having  an  image  that  is  “colored  in”  out  to  the  boundary. 

lower  binary  threshold. 

upper  binary  threshold. 

These  two  parameters  will  extract  a  binary  image  from  a  grayscale  image  via  thresh¬ 
olding  if:  UThresh  is  non  zero  AND  SorF  is  SET.  Then  all  pbcels  with  grey-levels  in 
[Lthresh,UThresh]  will  be  set  to  WHITE,  all  others  to  BLACK.  Thresholding  occurs 
before  any  morphology.  This  makes  it  possible  to  specify  binary  morphology  on  a 
graylevel  image.  If  UThresh  >  0  AND  LThresh  >  Uthresh,  MorphSub  aborts  with 
an  error.  If  UThresh  is  zero  no  thresholding  is  performed.  Note  that  one  can  use 
MorphSub  to  perform  a  simple  threshold  on  an  image  by  specifying  MOp  =  ERODE, 
ImgType  =  GRAIMG,  AutoSE  =  AUTO  Sx  =  1,  Sy  =  1,  Sz  =  0,  and  the  appropriate 
LThresh  and  UThresh. 

This  is  a  string  pointer  to  the  file  (or  path)  name  of  a  structuring  element  (SE)  file.  If 
this  pointer  is  not  NULL  and  AutoSE==0,  then  SEName  is  catenated  to  the  end  of 
the  value  of  environment  variable  SEPATH  to  get  the  SE  file  pathname.  (If  SEPATH 
is  not  defined,  then  SEName  is  used  alone.)  See  below  for  more  complete  info  on  SE’s. 

If  this  flag  is  non  zero,  it  signifies:  use  a  canned  or  auto  generated  SE.  Its  possible 


values  are: 

mnemon 

dec 

meaning 

ZERO 

0 

do  not  use  auto  SE 

AUTO 

1 

generate  disk  shaped  SE 

PLUS 

2 

use  3  by  3  “-f”  shaped  SE 

S3X3 

3 

use  3  by  3  square  SE 

S5X5 

4 

use  5  by  5  quasi  disk  SE 
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Sx,Sy 

Sz 

SETVpe 

SorF 


See  below  for  more  complete  info  on  SE’s. 

Horizontal  emd  vertical  dimensions  of  program  generated  SE.  Ignored  if  AutoSE  != 
AUTO. 

Gray  level  of  origin  pixel  in  program  generated  SE.  This  is  used  if  AutoSE  ==  AUTO. 
Ignored  if  AutoSE  !=  AUTO. 

specifies  whether  the  structuring  element  is  to  be  interpreted  as  a  gray-level  SE  or  a 
binary  SE.  (See  below  for  the  distinction.)  Its  possible  values  are: 


nmemon 

hex 

dec 

meaning 

GRASE 

0x0000 

0 

gray-level  SE 

BINSE 

0x0100 

256 

binary  SE 

Specifies  whether  the  morphological  operation  is  to  be  of  the  “set”  type  or  “function” 
type.  (See  below  for  the  distinction.)  Its  possible  values  -.e: 


mnemon 

hex 

dec 

meaning 

SET 

0x0000 

0 

set  operation 

FUNCT 

0x0400 

1024 

function  operation 

Structuring  element  specifications: 

SEFile:  A  structuring  element  file  is  5n  ASCII  file  of  integers  separated  by  spaces. 
The  first  two  numbers,  x,  y,  are  the  horizontal  and  vertical  dimensions  in  pixels  of  the 
smallest  rectangle  that  will  cover  the  structuring  element.  Both  x  and  Y  must  be  >  0. 
The  next  two  numbers,  i,  j  are  the  horizontal  and  vertical  coordinates,  respectively,  of 
the  SE  origin.  IMPORTANT:  The  origin  is  expected  to  be  in  the  covering  rectangle. 
If  not,  MorphSub  aborts.  The  upper  left  hand  corner  of  the  rectangle  has  coordinates 
(0,0);  the  lower  right  is  (i  —  1,  y—  1).  Following  the  first  four  integers  are  x  x  y  integers 
separated  by  spaces.  These  numbers  are  the  SE  elements.  Their  interpretation  depends 
on  the  morphologic^  operation  being  performed. 

Negative  SE  elements  are  ALWAYS  treated  as  logical  DON’T  CAREs.  That  is,  when 
the  operation  is  in  progress,  image  pixels  under  negative  SE  elements  are  ignored. 
Thus,  the  support  of  the  SE  is  limited  to  those  elments  that  are  nonnegative.  This 
permits  the  creation  of  odd-shaped  and  multiply  connected  SE’s  or  the  placement  of 
the  SE  origin  outside  the  body  of  the  SE.  If  ImgType  is  binary,  (i.e.  pixels  grouped  as 
zero  and  not  zero),  and  if  SEType  is  binary,  then  the  SE  is  used  to  perform  a  hit-or- 
miss  transform.  In  this  case,  zero  SE  elements  cover  the  “miss”  support  and  positive 
(nonzero)  elements  cover  the  “hit”  support.  The  actual  gray-levels  are  ignored. 

If  ImgType  is  binary,  and  SEType  is  gray  then  the  nonnegative  (both  zero  and  greater 
than  zero)  SE  elements  determine  the  support  of  a  “hit-only”  transform.  That  is,  the 
nonnegative  suport  is  used  as  a  standard  set-type  SE  for  set  (binary)  morphology.  (Of 
course,  the  other  gray-level  info  is  ignored.)  Note:  if  ImgType  is  binary,  then  a  set 
operation  is  performed  by  default  (SorF  is  ignored). 
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Rank 


NcScale 


The  interpretation  of  the  SE  elements  for  ImgType  gray  depends  on  the  flags  SEType 


and  SorF: 

SEType 

SorF 

action 

binary 

set 

Function-set  morphology  on  support  of  strictly 
greater  than  zero  SE  elements. 

binary 

funct 

Same  as  above. 

gray 

set 

Rinction-set  morphology  on  support  of  nonnegative 
(greater  than  or  equal  to  zero)  SE  elements. 

gray 

funct 

Function-function  morphology  on  support  of 

nonnegative  SE  elements. 


AUTO:  the  program  makes  an  SE.  The  self-made  SE  is  a  disk  with  a  diameter  of  Sx 
pixels  horizont2illy  and  Sy  pixels  vertically.  Sx  and  Sy  must  be  odd  and  greater  than 
or  equal  to  1.  If  AutoSE  ==  AUTO,  SEType  is  set  to  gray  (SEType  ==  GRASE). 
Sz  is  the  gray  level  of  the  center  pixel.  If  Sz  >  0:  the  SE  will  have  a  “curved”  top. 
(Use  Sz  >  0  for  a  rolling  bdl  transform.)  A  function  operation  (SorF  ==  FUNCT)  is 
performed.  If  Sz  ==  0,  the  SE  is  fiat-topped.  A  set  operation  is  performed  (SorF  == 
SET). 

S3X3:  specifies  the  SE  to.be  a  3  by  3  square  of  pixels. 

PLUS:  specifies  the  SE  to  be  a  3  by  3  “+”  shaped  set  of  pixels. 

S5X5:  specifies  the  SE  to  be  a  5x5  quasi  disk 
(square  without  corners)  of  pixels. 


If  one  of  the  canned  SE’s  (3x3,  plus,  or  5x5)  is  chosen,  then  SorF  is  set  to  SET  and 
SEType  is  set  to  GRASE. 

Note  that  when  AutoSE  !=  0,  the  values  of  SEType  and  SorF  are  ignored. 

This  is  meaningful  only  if  Mop  ==  RANK.  Then  the  parameter  indicates  the  order 
of  the  filter.  The  rank  option  of  MorphSub  is  actually  an  order  statistic  filter.  To 
compute  the  order,  MorphSub  counts  the  number  of  pixels  in  the  support  of  the  SE. 
In  a  gray-level  SE  that  is  all  values  greater  than  or  equal  to  zero.  We  did  not  define  a 
hit-or-miss  rank  filter.  Therefore,  to  compute  the  support  of  a  binary  SE,  MorphSub 
changes  the  zeros  in  a  binary  SE  into  -I’s  (DON’T  CARES).  Then  all  the  rank  routine 
considers  the  support  to  be  all  SE  elements  >  ZERO. 

If  Rank  ==  0,  MorphSub  applies  a  median  filter.  If  Rank  ==  1  it  does  a  dilation.  If 
Rank  ==  support  of  SE,  then  MorphSub  does  an  erosion.  Otherwise,  MorphSub  does 
an  order  statistic  filter  of  order  Rank. 

When  a  funclicn  processing  operation  is  performed  a  gray-level  image  with  a  gray- 
level  SE,  the  operation  is  performed  with  16-bit-precision  signed  arithmetic.  When 
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the  operation  is  through,  if  NoScale  ==  0,  MorphSub  scales  the  result  to  fit  into  one 
byte  per  pixel.  If  NoScale  !=  0,  morph  limits  the  results  at  BLACK  and  WHITE. 

Scaling  is  performed  as  follows:  if  the  range  of  the  result  is  less  than  one  byte  (i.e. 
MaxPixVal  —  MinPLxVal  <  or  =  256)  the  result  is  translated  so  that  the  minimum 
is  zero.  (i.e.  each  resultant  pixel  is  replaced  with  pixel  —  MinPixVal)  Otherwise,  the 
minimum  is  subtracted  from  each  pixel  and  the  the  difference  is  scaled  by  the  ratio 
WHITE/(MaxPixVal-MinPixVal). 

Dsp  If  this  is  true  (nonzero)  some  info  about  what  is  going  on  is  displayed, 
morph 

The  user  program  morph  takes  paramters  from  the  user  on  the  command  line  and  applies  routine 
MorphSub  to  Sun  rasterfiles.  A  brief  description  of  morph  foillows.  This  description  borrows  heavily 
on  the  description  of  MorphSub  above. 

usage: 

morph  <  In  >  Out  -m  e|d|o|c|r|t|b|p|q  [-i  g|b]  [-s  g|b]  [-o  s|f]  [-t  nnn  [mmm]] 

[-r  med|nnn]  [-z]  [-n]  [-v]  -k  SEFile  |  3x3  |  5x5  |  plus  |  auto  x  y  [z] 


where: 

In  is  the  path  name  of  the  Sun  rasterfile  input  image.  (Type  jin  so  the  file  is  read  in 
through  stdin.) 

Out  is  the  pathname  of  the  Sun  rasterfile  output  image.  (Type  iOut  so  the  file  is  output 
through  stdout.) 

morph  extracts  and  operates  on  the  luminance  component  of  the  image.  Thus  the 
output  of  morph  is  grayscale  even  if  the  input  is  color. 

-m  The  letter  following  -m  indicates  the  morphological  operation: 

e  -  erode 
d  -  dilate 
o  -  open 
c  -  close 
r  -  rank  filter 

t  -  top  hat  transform  (image  minus  opening) 
b  -  bot  hat  transform  (closing  minus  image) 
p  -  1  &&  !E  (pixelwise)  where  I  =  original  image;  E  =  eroded  image; 
q  -  D  &&  !1  (pixelwise)  D  =  dilated  image; 


one  of  these  letters  must  be  specified;  there  is  no  default. 

p  with  a  binary  hit-or-miss  structuring  element  will  delete  in  a  binary  image,  white 
features  with  the  shape  of  the  “hit”  portion  of  the  SE.  (e.g.  one  can  easily  devise  a 
SE  to  delete  isolated  pixels). 
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p  with  a  gray-level  SE  will  delete  the  “interiors”  from  sets  of  white  pixels  in  a  binary 
image. 

q  with  a  binary  hit-or-miss  structuring  element  will  delete  in  a  binary  image,  black 
features  with  the  shape  of  the  “hit”  portion  of  the  SE.  (e.g.  one  can  easily  devise  a 
SE  to  delete  isolated  pixels). 

q  with  a  gray-level  SE  will  delete  the  “interiors”  from  sets  of  black  pixels  in  a  binary 
image. 

-i  Switch  -i  indicates  that  the  next  letter  Sells  the  image  type:  either  g  for  a  gray-level 
image  or  b  for  a  binary  image.  If  -i  is  not  included,  the  default  is  gray-level. 

-s  Switch  -s  indicates  that  the  next  letter  tells  the  structuring  element  type:  either  b  for 
a  binary  SE  or  g  for  a  gray-level  SE.  If  -s  is  not  included,  the  default  is  binary. 

-o  The  letter  following  -o,  either  s  or  f,  indicates  that  the  operation  is  either  a  set  operation 
or  a  function  operation.  (See  reference.)  If  -o  is  not  included,  the  default  is  set  op. 

-t  -t  nnn  [mmm]  indicates  that  a  threshold  of  value  nnn  from  below  (and  mmm  from 
above;  if  unspecified  mmm  ==  255)  will  be  used  on  the  input  if  the  following  2 
criteria  are  true:  the  input  is  a  gray-level  image  AND  the  operation  is  a  set  operation. 
If  the  two  criteria  are  true  and  -t  nnn  [mmm]  is  not  included,  the  operation  is  treated 
as  a  function  and  set  processing  (FSP)  operation  (See  ref.).  If  the  criteria  are  not  true 
and  -t  nnn  is  specified  anyway,  it  is  ignored.  Note  that  you  can  do  a  simple  threshold 
of  a  gray  level  image  with: 

morph  <  in.ras  >  out.ras  -m  e  -t  128  -k  auto  1  1 

-r  Switch  -r  is  meaningful  only  if  -m  r  is  specified.  Then  the  field  following  -r  indicates 
the  order  of  the  filter.  If  the  letters  “med”  are  in  the  field,  a  median  filter  is  used.  If 
the  field  contains  a  number,  nnn,  then  that  value  is  used.  If  -op  r  is  specified  and  -r 
med|nnn  is  not,  the  rank  filter  defaults  to  a  median  filter. 

-z  The  presence  of  switch  -z  tells  the  program  NOT  to  zero-pad  the  boundary  of  the 
image. 

-n  Switch  -n  tells  the  program  NOT  to  scale  the  output  of  the  operation.  Such  scaling 
happens  by  default  for  a  function  operation  (-o  f)  on  a  gray-scale  image  (-i  g)  with  a 
gray-scale  SE  (-s  g).  This  switch  is  ignored  by  other  operations. 

-V  Switch  -V  tells  the  program  to  display  some  info  as  it  computes. 
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-k  Switch  -k  indicates  that  the  next  field  is  one  of  5  things: 


3x3  This  selects  S3X3  in  routine  MorphSub. 
plus  This  selects  PLUS  in  routine  MorphSub. 
5x5  This  selects  S5X5  in  routine  MorphSub. 
auto  X  y[z\-  the  program  makes  an  SE. 


The  self-made  SE  created  when  -k  auto  is  specifeid  is  a  disk  with  support  covering  x 
pixels  horizontally  and  y  pixels  vertically,  x  and  y  must  be  odd.  If  specified,  z  is  the 
gray  level  of  the  center  pixel.  If  ziO,  a  function  operation  (-o  f)  is  performed.  If  z  is 
not  given  or  z  =  0,  the  level  defaults  to  BLACK  (0)  and  a  set  operation  is  performed 
(-0  s). 

If  the  field  following  -k  is  not  one  of  the  above,  then  the  string,  called  SEFILE  in  the 
usage  example,  is  taken  as  the  pathname  f  a  structuring  element  file.  If  the  user  has 
an  environment  variable  called  SEPATH,  the  program  appends  SEFILE  to  it  for  the 
complete  pathname. 

If  one  of  the  canned  SB’s  (3x3,  plus,  or  5x5)  is  chosen,  the  -s  and  -o  flags  are  forced 
to  be  -s  g  -o  s. 

If  “-k  auto”  is  specified  the  SE  type  is  forced  to  gray  (-s  g). 


Included  with  morph  are  Unix  c-shell  scripts  that  set  up  morph  to  run  some  of  the  more  common 
morphological  operations,  these  include: 


bclose 

bdilate 

berode 

bopen 

brank 

cleanup 

gfclose 

gfdilate 

gferode 

gfopen 

grank 

gsclose 

gsdilate 

gserode 

gsopen 

hitormiss 

invroll 

invtop 

medge 

rollball 

thresh 

tophat 


binary  closing 
binary  dilation 
binary  erosion 
binary  opening 
binary  rank  filter 

example  of  image  noise  reduction  using  rolling  ball 

gray-level  function-function  closing 

gray-level  function-function  dilation 

gray-level  function-function  erosion 

gray-level  function-function  opening 

gray-level  rank  filter 

gray-level  function-set  closing 

gray-level  function-set  dilation 

gray-level  function-set  erosion 

gray-level  function-set  opening 

binary  hit-or-miss  transform 

inverted  rolling  ball  transform 

inverted  tophat  transform  (bothat) 

morphological  edge  detection 

rolling  ball  transform 

threshold  a  gray-level  image 

tophat  transform 


mclean 


Program  mclean  is  an  image  enhancement  /  noise  reduction  program  that  uses  2-dimensional 
gray-scale  mathematical  morphology.  The  program  cleans  up  any  image  with  noise  in  its  luminance 
component  such  that  the  noise  has  a  smaller  variance  than  the  most  important  luminance  edges. 

In  its  default  configuration,  the  program  computes  a  smoothed  version  (S)  of  the  luminance 
component  (L)  of  the  original  image  (I)  by  taking  a  pointwise  average  of  the  opening  (O)  and  the 
closing  (C)  of  L  with  a  user-specified  structuring  element  (SE).  It  subtracts  S  from  L  to  create  a 
residual  image  (R).  The  positive  pixels  in  R  form  an  image  called  the  “tophat”  (T)  and  the  negative 
pixels  form  an  image  called  the  “bothat”  (B).  T  contains  bright  image  features  that  could  not  be 
created  with  the  SE.  (Recall  that  0  is  the  best  approximation  of  L  than  can  be  created  by  overlaying 
SE’s  subject  to  the  constraint  that  L  is  pointwise  everywhere  GREATER  than  or  equal  to  O.)  T, 
then,  contains  noise  as  well  lines,  spots,  and  other  thin  or  small  (with  respect  to  the  SE),  bright 
image  features.  B  contains  similar  types  of  features,  but  dark  rather  than  light.  (Recall  that  C  is 
the  best  approximation  of  L  than  can  be  created  by  overlaying  SE’s  subject  to  the  constraint  that 
L  is  pointwise  everywhere  LESS  than  or  equal  to  B.) 

If  the  gray  level  variance  of  the  noise  in  L  is  less  than  that  of  important  luminance  features  such 
as  lines  and  highlights,  the  noise  can  be  filtered  from  R  by  inverse  center  clipping.  That  is,  create  a 
thresholded  residual  image  (RT)  by  replacing  with  zero  the  value  of  any  pixel  in  R  whose  magnitude 
is  lower  than  some  threshold.  In  general,  the  upper  threshold,  u  (the  threshold  for  T),  is  different 
in  magnitude  from  the  the  lower  threshold,  I  (the  threshold  for  B).  Threshold  u  is  nonnegative  and 
1  is  nonpositive.  The  important  features  in  R  remain  in  RT  along  with  some  isolated  very  bright  or 
very  dark  noise  points  (isolated  single  pixels).  Define  the  support  of  RT  to  be  the  set  of  all  nonzero 
pixels  in  RT.  Then,  the  noise  points  are  removed  by  perfoming  an  isolated  delete  on  the  support  of 
RT. 

RT  is  recombined  with  S  as  follows:  Let  TT  be  the  thresholded  tophat  part  of  RT,  and  let  BT  be 
the  thresholded  bothat  part  of  RT.  Subtract  u  pixelwise  from  the  TT  and  add  the  result  (pixelwise) 
back  to  the  S.  Add  1  pixelwise  to  BT  and  subtract  the  result(pixelwise)  from  S.  This  procedure  puts 
the  important  original  features  back  in  L  without  the  noise. 

Program  mclean  operates  on  Sun  rasterfiles.  It  automatically  extracts  the  luminance  portion  of 
the  image  if  the  rasterfile  has  a  colormap.  The  output  of  mclean  is  a  grayscale  rasterfile.  To  recreate 
a  color  image  requires  extracting  the  hue  and  saturation  components  from  the  original  image  and 
recombining  these  with  the  cleaned  up  luminance.  The  author  has  not  tried  morphological  processing 
on  the  H  and  S  components.  Also,  to  remake  LHS  image  back  into  an  8-bit  color  (lookup-table 
indexed)  image  requires  use  of  a  color  quantization  scheme  such  as  Heckbert’s  median  cut.  It  is  not 
known  what  noise  this  will  reintroduce  into  the  result. 

usage: 

mclean  InFile  OutFile  [-s  g|b]  [-o  s|f]  [-t|b]  [-1  nnn]  [-u  nnn]  [-f  nn.nn]  [-r  nn.nn]  [-h]  [-a  OCFile] 

-k  SEFile  |  3x3  |  5x5  j  plus  |  auto  xxx  yyy  [zzz] 


where: 

InFile  is  the  path  name  of  the  Sun  rasterfile  input  image. 

OutFile  is  the  pathname  of  the  Sun  rasterfile  output  image. 

mclean  extracts  and  operates  on  the  luminance  componei)*  of  the  image.  Thus  the 
output  of  mclean  is  grayscale  even  if  the  input  is  color. 
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-s  Switch  -s  indicates  that  the  nest  letter  teUs  the  structuring  dement  t>-pe:  dther  b  for 
a  binary  SE  or  g  for  a  gray-Ievd  SE.  If  -s  is  not  included,  the  default  is  binary. 

-o  The  letter  following-o,  dther  sor  f,  indicates  that  the  operation  is  dther  aset  operation 

or  a  function  operation.  If  -o  is  not  included,  the  default  is  set  op.  Sdecting  f  along 
with  a  not-flat-topped  SE  will  more  dosdy  ^proximate  the  oripnal  image  However, 
it  also  seems  to  make  the  noise  reduction  less  effective. 

-t  The  presence  of  -t  on  the  command  line  tdls  mclatn  to  do  a  tophat  only  process 
(i.e.  don’t  use  the  bothat).  This  can  be  more  effective  Than  the  general  top/bot 
^proach  if  it  appears  that  most  of  the  image  noise  and  features  are  lighter  than  thdr 
surroundings.  Similarly,  -b  means  do  a  bothat  only.  This  can  be  more  effective  than 
the  combo  if  most  of  the  nose  and  features  are  darker  than  thdr  backgrounds. 

-I  Switches  -1  and  -u  cause  mdean  to  use  the  numbers  following  as  lower  <uid  upper 
thresholds,  respectively.  These  override  mclean’s  automatically  determined  bothat 
and  tophat  thresholds. 

-f  Switch  -f  precedes  a  floating  point  nuber  that  is  used  as  a  scale  factor  to  adjust  both 
the  lower  and  upper  thresholds.  If  it  seems  that  mclean  has  been  too  extreme  in  its 
thresholding,  use  -f  0.nn  to  adjust  the  threshold  down.  If,  on  the  other  hand,  too  much 
noise  appears  to  remain  in  the  output,  use  -f  l.nn.  You  could  actually  go  higher  than 
1.99  but  but  if  the  scale  factor  is  too  large,  you  will  get  back  the  original  image.  When 
not  selected,  the  scale  factor  defaults  to  1.0. 

-r  The  floating  point  number  following  -r  tells  mclean  to  amplify  the  thresholded  resid¬ 
ual,  RT,  before  recombining  it  with  the  the  smoothed  image,  S.  This  exagerates  the 
intensity  of  the  remaining  features.  If  the  number  is  more  than  about  1.05,  the  result 
looks  artiflcial  This  defaults  to  1.  When  not  selected,  RT  is  not  amplifled. 

-h  Flag  -h  tells  mclean  to  use  an  alternate  method  to  select  the  lower  and  upper  thresh¬ 
olds.  When  -h  is  NOT  present  (the  default)  mclean  computes  the  thresholds  as  follows; 
Compute  a  graylevel  histograms  ht  and  hb  from  the  tophat  image  T  and  bothat  im¬ 
age  B,  respectively,  ht  is  deflned  over  nonnegative  values  only  and  hb  is  defined  over 
nonpositive  values  only,  mclean  takes  as  lower  threshold,  1,  the  second  moment  of  hb. 
Similarly,  u  is  the  second  moment  of  ht.  When  -h  IS  present,  mclean  creates  one  his¬ 
togram,  h  defined  over  all  values.  It  chooses  upper  and  lower  thresholds  as  the  value 
one  standard  deviation  above  and  below  the  mean,  respectively.  Generally  the  default 
procedure  does  a  better  job  because  in  most  images  there  is  an  unequal  distribution 
of  light  and  dark  features  (and  noise). 

-a  Switch  -a  tells  mclean  to  write  out  the  smoothed  image  to  a  file  where  OCfile  repre¬ 
sents  the  user-suplied  file  name.  It  can  be  quite  useful  to  h?.ve  this,  sometimes,  for 
experimentation. 

-k  Switch  -k  indicates  that  the  next  field  defines  the  morphological  stucturing  element 
(SE)  in  exactly  the  same  way  as  for  program  morph. 
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IV.  3D  BIAGE  MORPHOLOGY  SOFTWARE 


This  section  details  MoTpb3DS*h,  the  3D  morphology  subroutine,  and  and  morphSd  the  user 
program  for  3D  morphology  that  operates  on  sequences  of  Sun  rasteriiles. 

MorphSDSub 

Subroutine  MorphSOSub  performs  many  (if  not  all)  possible  3D  morphological  operations  on 
gray-level  or  binary  image  time  sequences  or  3D  voxel  images.  These  include  hit-or-miss  transforms, 
order  statistic  filters,  function  processing  or  set  processing  erosion,  dilation,  opening  and  closing, 
tophat  (image  minus  opening),  bothat  (closing  minus  image),  and  isolated  delete  functions. 

To  use  make  the  following  call  in  a  c  program: 


MorphSub(  MOp,  Inputimage,  Instruct,  Outputimage,  OutStruct,  Ix,  ly,  ImgType,  NZPad,LTbresh,  UThre 
SE,  AutoSE,  sX,  sY,  sZ,  sV,  sorgx,  sorgy,  sorgz,  SEType,  SorF,  Rank  ) 


where  the  parameters  are  defined  by: 


int  MOp; 

int  (*lnputlmage)(); 

int  *  Instruct; 

int  (*OutputImage)(); 

int  *  OutStruct; 

int  Ix,  ly; 

int  ImgTyps; 

int  NZPad ; 

int  LThresh; 

int  UThresh; 

int  ♦SE; 

int  sX,sY,8Z,sV; 

int  sorgx, sorgy, sorgz; 

int  SEType; 

int  SorF; 

int  Rank; 


Morphological  operation  to  perform 

address  of  image  input  function 

address  of  structure  to  pass  to  Inputimage 

address  of  image  output  function 

address  of  structure  to  pass  to  Outputimage 

image  horizontal,  vertical  dimensions 

image  type  (gray-level  or  binary) 

flag.  F  zeropadding  of  input 

lower  binary  threshold  value 

upper  binary  threshold  value 

SE  in  plane-row-major  order 

SE  X,  y,  and  z  support  dims  and  max  gray-lev  (v) 

SE  origin  coordinates 

Binary  SE  or  gray-level  SE 

Set  operation  or  Function  operation 

rank  for  rank  filter 


A  detailed  explanation  of  parameters  follows: 

Mop  Morphological  Operation.  This  integer  parameter  can  have  the  following  values: 


mnemon 

hex 

dec 

action 

ERODE 

0x0001 

1 

erosion 

DILATE 

0x0002 

2 

dilation 

OPEN 

0x0004 

4 

opening 

CLOSE 

0x0008 

8 

closing 

RANK 

0x0010 

16 

rank  filter  (order  statistic  filter) 
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MAXMIN  0x0020  32 
MINMAX  0x0040  64 


max  of  minima  from  nbhd.  in  each  image 
min  of  maxima  from  nbhd.  in  each  image 


Moreover,  the  erode  and  dilate  functions  can  be  “negated”  using; 


nmemon  hex 

dec 

action 

NOTFLG  0x0080 

128 

“not”  flag  for  binary  erodes  and  dilates 

Let  I  =  original  image;  E  =  eroded 
parameters: 

image;  D  = 

dilated  image.  The  following 

mnemon 

hex 

dec 

action 

ERODE  I  NOTFLG 

0x0081 

129 

I  !E  (pixelwise) 

DILATE  I  NOTFLG 

0x0082 

130 

D  ilk.  !I  (pixelwise) 

ERODE  I  NOTFLG  with  a  binary  hit-or-miss  structuring  element  will  delete  in  a  binary 
image  sequence,  white  features  with  the  shape  of  the  “hit”  portion  of  the  SE.  (e.g.  one 
can  easily  devise  a  SE  to  delete  isolated  white  pixels).  With  a  gray-level  SE  it  will  delete 
the  “interiors”  from  sets  of  white  pixels  in  a  binary  sequence. 

DILATE  I  NOTFLG  with  a  binary  hit-or-miss  structuring  element  will  delete  in  a  binary 
image  seq.,  black  features  with  the  shape  of  the  “hit”  portion  of  the  SE.  (e.g.  one  can 
easily  devise  a  SE  to  delete  isolated  black  pixels).  With  a  gray-level  SE  it  will  delete  the 
“interiors”  from  sets  of  black  pixels  in  a  binary  image  sequence. 

Each  of  these  operations  occurs  over  a  3D  data  set  within  a  3D  neighborhood  (nbhd) 
defined  by  the  3D  shape  of  the  structuring  element  (SE).  The  definitions  are  exactly  the 
same  as  their  2D  counterparts.  (Mathematical  morphology  is  defined  in  n  dimesions.) 
The  exceptions  are  maxmin  and  minmax  which  have  no  counterparts  in  2D.  These  op¬ 
erators  were  designed  for  use  with  time  sequences.  Consider  the  3D  SE  to  be  a  stack 
of  z  2D  SE’s.  The  2D  SE’s  trace  nbhds  in  adjacent  sequential  images.  Maxmin  finds 
the  min  in  each  2D  nbhd  and  takes  the  maximum  of  these.  Minmax  takes  the  smallest 
maximum  from  among  the  2D  nbhds.  The  idea  behind  minmax  is  this:  Applied  to  a  time 
sequence,  a  3D  structuring  element  demarcates  an  area  (a  nbhd)  in  successive  images. 
(The  nbhd  in  an  image  may  or  may  not  be  the  same  the  nbhds  in  succesive  images.)  If 
a  bright,  moving  particle  stays  within  the  successive  nbhds  the  result  of  the  minmax  will 
be  a  bright  pixel.  If,  on  the  other  hand,  the  particle  moves  out  of  one  of  the  nbhds,  the 
result  will  be  a  darker  pixel.  Thus,  when  used  with  minmax,  an  appropriately  shaped 
SE  can  act  as  a  velocity  filter.  Maxmin  is  the  same  with  with  respect  to  dark  particles. 

I/O  Inputlmage(),  Instruct,  OutputImage(),  OutStruct 

These  are  pointers  to  user  supplied  lO  routines  and  data  structures  that  Morph3DSub 
uses  to  read  and  write  images.  These  are  described  below. 
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Ix  Image  horizontal  dimension.  Number  of  pixels  per  line  (or  number  of  colunms  per  row) 
in  an  image  &om  the  sequence. 

ly  Image  vertical  dimension.  Number  of  lines  (or  rows)  in  an  image. 

All  images  in  the  input  sequence  must  have  the  same  xy  dimensions. 

ImgType  Image  type:  This  specifies  whether  the  input  image  sequence  is  to  be  treated  like  a  gray 
level  or  binary  sequnce.  The  values  in  a  binary  image  are  zero  and  not-zero  (it  does  not 
matter  which  of  the  values  in  [BLACK, WHITE]).  Parameter  values: 


mnemon 

hex 

dec 

action 

GRAIMG 

0x0000 

0 

gray-level  image  sequence 

BINIMG 

0x0200 

512 

binary  image  sequence 

NZPad  If  zero  (FALSE)  MorphSDSub  will  extend  the  size  of  each  input  image  internally  and 
create  a  border  of  zeros  around  it  in  the  z  and  y  dimensions.  If  nonzero  (TRUE) 
MorphSDSub  will  not  do  the  zero  padding. 

There  are  image  border  effects  with  each  NZPAD  option.  The  effects  differ  depending 
on  the  option.  With  the  option  FALSE,  MorphSDSub  does  what  it  can  to  “color  in” 
each  image  near  its  perimeter.  With  this  option  TRUE,  each  output  image  has  a  border 
of  zeros  inside  it  the  width  and  height  of  the  SE.  That  is,  the  transformed  area  of  each 
output  image  is  smaller  than  the  actual  image  dimensions.  This  is,  in  a  sense,  a  more 
accurate  result  than  the  zero  padded  default.  To  zero  pad  the  input  permits  the  program 
to  transform  the  border  region,  but  it  does  this  on  the  assumption  that  the  original  scene 
was  black  outside  the  image  sequence.  This,  of  course,  is  almost  never  true.  Thus,  the 
border  region  is  inaccurately  transformed.  Use  this  switch  if  accuracy  is  more  important 
than  having  an  image  sequence  that  is  “colored  in”  out  to  the  boundary. 

Notes:  This  is  all  internal  to  MorphSDSub.  The  input  and  output  images  are  all  Ix  by  ly 

independent  of  this  flag. 

MorphSDSub  always  zero  pads  in  the  z-dimension.  That  is  it  always  catenates  enough 
zero  images  to  the  beginning  and  and  of  the  sequence  so  that  the  number  of  images 
output  equals  the  number  of  images  input. 

LThresh  lower  binary  threshold. 

UTHresh  upper  binary  thrshold. 

These  two  parameters  will  e.xtract  binary  images  from  grayscale  images  via  thresh¬ 
olding  if:  UThresh  is  non  zero  AND  SorF  is  SET.  Then  all  pixels  with  grey-levels  in 
[Lthresh,UThresh]  will  be  set  to  WHITE,  all  others  to  BLACK.  Thresholding  occurs  be¬ 
fore  any  morphology.  This  makes  it  possible  to  specify  binary  morphology  on  a  graylevel 
image  sequence.  If  UThresh  >  0  AND  LThresh  >  Uthresh,  MorphSDSub  aborts  with 
an  error.  If  UThresh  is  zero  no  thresholding  is  performed.  Note  that  one  can  use 
MorphSDSub  to  perform  a  simple  threshold  on  an  image  by  specifying  MOp  =  ERODE, 
ImgType  =  GRAIMG,  AutoSE  =  AUTO  sX  =  1,  sY  =  1,  sZ  =  1,  sV=l  and  the  appro¬ 
priate  LThresh  and  UThresh. 
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SE  SE,  sX,  sY,  sZ,  sV,  sorgx,  sorgy,  sorgz 

This  is  a  pointer  to  the  structuring  element  (SE).  Conceptually,  the  SE  fits  in  a  rectan¬ 
gular  box  with  dimensions  sX,  sY,  sZ  in  3-space.  MorphSDSub  treats  the  SE  like  it  is  a 
collection  of  sZ  rectangles,  each  with  sX  elements  in  the  horizontal  dimension  and  sY  in 
the  vertical.  The  origin  of  the  SE  is  specified  by  (sorgx,sorgy,sorgz)  where  (0,0,0)  refers 
to  the  front  (ijs.  first  rectangle)  upper  lefthand  comer  of  the  box.  The  origin  must  lie 
within  the  endossng  box  or  MorphSDSub  aborts.  The  SE  is  stored  as  a  list  of  ints  in 
plane-row-major  order.  That  is  the  first  int  is  the  front  upper  lefthand  comer  of  the  SE 
and  the  last  int  is  the  back  lower  righthand  comer  of  the  SE.  Sequentially  through  the 
int  list  is  the  first  row  of  the  first  plane  of  the  SE,  followed  by  the  second  row  of  the  first 
plane,  etc.,  until  the  first  plane  is  specified,  then  comes  the  second  plsine  in  row-major 
order,  and  so  on. 

sZ  is  a  particularly  important  parameter.  MorphSDSub  uses  this  value  as  the  z  depth  of 
its  3D  image  buffer. 

The  3D  SE  scans  the  image  sequence  in  3D  moving  window  fashion.  That  is,  the  SE 
moves  systematically  over  the  entire  3D  image  so  that  the  origin  of  the  SE  coincides 
with  each  pixel  in  the  image  exactly  once.  The  way  I  have  set  up  the  program,  for  each 
position  of  the  moving  window  scan,  consecutive  planes  of  the  3D  SE  lie  in  consecutive 
2D  images  from  the  sequence.  To  wit,  SE  planes  and  image  (voxel)  planes  coincide.  At 
any  point  in  the  scan,  the  output  image  is  in  the  same  sequential  position  as  the  image 
that  contains  the  SE  origin. 

All  seven  of  the  above  parameters  can  be  supplied  to  MorphSDSub  by  the  companion 
routine  GetSE().  The  functionality  of  GetSE  was  made  separate  from  MorphSubSD 
because  a  calling  program  may  need  to  know  the  structure  of  the  SE  (in  particular  its 
z-dimension  size)  to  set  up  image  10.  (E.g.,so  it  knows  which  output  image  corresponds 
to  which  input  image). 

A  more  complete  description  of  the  structure  of  the  SE  is  given  below. 

SEType  specifics  whether  the  structuring  element  is  to  be  interpreted  as  a  gray-level  SE  or  a 
binary  SE.  (See  below  for  the  distinction.)  Its  possible  values  are: 


mnemon 

hex 

dec 

action 

GRASE 

0x0000 

0 

gray-level  SE 

BINSE 

0x0100 

256 

binary  SE 

SorF  Specifies  whether  the  morphological  operation  is  to  be  of  the  “set”  type  or  “function” 
type.  (See  below  for  the  distinction.)  Its  possible  values  are: 


mnemon 

hex 

dec 

action 

SET 

0x0000 

0 

set  operation 

FUNCT 

0x0400 

1024 

function  operation 

Rank  This  is  meaningful  only  if  Mop  ==  RANK.  Then  the  parameter  indicates  the  order  of  the 
filter.  The  rank  option  of  MorphSDSub  is  actually  an  order  statistic  filter.  To  compute 
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the  order,  MorpbSDSub  counts  the  number  of  pixels  in  the  3D  suppoyi  of  the  SE.  In 
a  gray-ievel  SE  that  is  all  values  greater  than  or  equal  to  zero.  We  diu  not  define  a 
hit-or-miss  rank  filter.  Therefore,  to  compute  the  support  of  a  binary  SE,  MorphSDSub 
changes  the  zeros  in  a  binary  SE  into  -I’s  (DON’T  CARES).  Then  all  the  rank  routine 
considers  the  support  to  be  all  SE  elements  ZERO. 

If  Rank  ==  0,  MorphSDSub  applies  a  median  filter.  If  Rank  ==  1  it  does  a  dUation.  If 
Rank  ==  support  of  SE,  then  MorphSDSub  does  an  erosion.  Otherwise,  MorphSDSub 
does  an  order  statistic  filter  of  order  Rank. 

User  supplied  I/O  routines; 

Routine  MorphSDSub  operates  on  a  3D  image  as  a  sequence  of  2D  images.  The  sequence 
can  be  either  a  time  sequence  of  2D  images  or  “plane  slices”  of  voxels.  Because  there  are 
few  workstations  that  can  load  a  large  3D  image  into  memory  all  at  once,  MorphSDSub 
scrolls  through  the  sequence  keeping  only  the  minimum  number  in  memory  at  once.  (The 
minimum  number  is  SZ,  the  z-dimension  of  the  SE.) 

I  wanted  MorphSDSub  to  be  independent  of  any  particular  image  file  format.  This  meant 
that  MorphSDSub  could  not  easily  do  its  own  file  I/O.  However,  because  of  the  size  of 
3D  images,  the  3D  morphology  program  needs  to  read  images  successively  from  files. 
Thus,  I  designed  MorphSDSub  to  “call  back”  to  its  host  program  for  the  next  image  in 
the  input  sequence.  Likewise,  it  calls  back  to  output  an  image.  This  approach  lets  a 
user  write  a  host  program  to  do  file  I/O  on  any  format  he  likes  and  call  MorphSDSub 
to  operate  on  them.  To  do  this,  a  user  needs  to  write,  along  with  the  host  program,  an 
image  input  routine  and  an  image  output  routine.  The  user  passes  the  addresses  of  these 
two  “call  back  procedures”  to  MorphSDSub  during  its  function  call.  If  Imageln()  is  the 
input  procedure  and  ImageOut()  the  output  procedure  that  the  user  has  written,  Then 
the  host  program  calls  Morph3DSub()  like  this: 

Mocph3DSub(  MorphOp,  Imagein,  InParams,  ImageOut,  OutParams,  ... 

InParams  and  OutParams  are  pointers  to  structures  that  contain  all  the  data  necessary 
for  Imageln()  and  ImageOut()  to  work,  yet  about  which  Morph3DSub()  does  not  need 
to  know. 

MorphSDSub  receives  the  address  of  the  image  input  routine  in  variable  Inputimage. 
It  gets  the  address  of  the  output  routine  in  Outputimage.  The  local  input  and  output 
data  structure  pointers  are  passed  in  Instruct  and  OutStruct,  respectively.  Whenever 
MorphSDSub  needs  a  new  image,  it  calls 

lOError  =  Inputlmage(  In,  NumInReqs,  &eof,  Instruct ); 

Whenever  MorphSDSub  is  ready  to  output  an  image,  it  calls 

lOError  =  OutputImage(  Out,  NumOutReqs,  OutStruct ); 
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MorpbSDSub  expects  the  call  back  procedures  to  have  the  following  structure: 


int  Image!n(  In,  NumReqs,  eof,  Instruct ) 
byte  *In; 
int  NumReqs; 
int  *eof; 

struct  lOS  *InStruct; 

{ 

1.  Figure  out  name  of  next  input  file; 

2.  Read  next  image  file  and  set  *eof  =  FALSE; 

(If  there  was  an  error  in  the  read,  return  with  error.) 

3.  If  the  file  could  not  be  read  be  cause  there  are  no  more 
images  set  *eof  =  TRUE  and  return  with  no  error; 

4.  Decode  image  and  extract  luminance  information  from  image 
(lumi.  extraction  not  necessary  if  image  is  grayscale); 

5.  Copy  grayscale  image  in  row  major  order,  one  byte  per  pixel, 
to  consecutive  locations  starting  at  In; 

6.  Return  with  no  error. 

} 

and 

int  ImageOut(  Out,  NumReqs,  OutStruct ) 
byte  *Out; 
int  NumReqs; 
struct  lOS  *OutSttuct; 

{ 

1.  Figure  out  name  of  next  output  file; 

2.  Copy  grayscale  image  in  row  major  order,  one  byte  per  pixel, 
from  consecutive  locations  starting  at  Out; 

3.  Construct  output  image  in  appropriate  format; 

4.  Write  image  to  file; 

5.  If  write  was  sucessful,  return  with  no  error. 

(Else  return  with  error.) 

} 


The  names  of  the  two  callback  procedures  are  defined  by  the  user.  For  siuiplicity  of 
description,  I  will  assume  they  have  the  names  Imagela()  and  ImageOui(). 

The  first  argument,  In,  of  the  ImageIn  is  a  pointer  to  an  image  array.  MorphSDSub 
allocates  this  array  and  passes  its  address  to  the  ImageIn  through  this  variable.  ImageIn 
reads  and  decodes  an  image  file.  It  extracts  the  luminence  component  and  writes  the 
gray-level  pixels  (one  byte  per  pixel)  in  row-major  order  starting  at  the  address  iu  In. 

NumReqs  is  the  number  of  input  requests  that  have  been  made  by  IvIorphSDSub.  When 
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Imagein  is  called  the  first  time  NumReqs  ==  1.  Imagein,  may  or  may  aot  use  this 
information. 

The  third  argument,  eof,  is  a  pointer  to  a  flag  in  MorphSDSub.  Imagein  must  -^rite  a  zero 
there  whenever  it  inputs  an  image.  If  there  are  no  more  images  in  the  input  sequence, 
Imagein  must  write  a  nonzero  number  there. 

Instruct  is  a  pointer  to  a  structure  that  includes  information  necessary  for  Imagein  to 
work  but  is  not  directly  needed  by  MorphSDSub.  Instruct  is  Imageln’s  link  back  to  the 
host  program.  For  example,  Instruct  my  contmn  image  header  and  colormap  information, 
filename  tags,  or  information  to  be  conveyed  to  ImageOut. 

The  first  argument.  Out,  of  the  ImageOut  is  a  pointer  to  an  image  array.  MorphSDSub 
allocates  this  array  and  passes  its  address  to  the  ImageOut  through  this  variable.  Ima;- 
geOut  creates  an  output  file  incorporating  the  image  in  this  array. 

NumReqs  is  the  number  of  output  requests  that  have  been  made  by  MorphSDSub.  When 
ImageOut  is  called  the  first  time  NumReqs  ==  1.  The  output  routine,  ImageOut,  may 
or  may  not  use  this  information. 

OutStruct  is  a  pointer  to  a  structure  that  includes  information  necessary  for  ImageOut 
to  work  but  is  not  directly  needed  by  MorphSDSub.  OutStruct  is  ImageOut’s  link  back 
to  the  host  program.  For  example,  OutStruct  my  contain  image  header  and  colormap  in¬ 
formation,  filename  tags,  or  information  to  be  conveyed  to  Imagein.  It  can  be  convenient 
to  have  OutStruct  ==  Instruct. 

Both  callback  procedures  return  a  functional  value.  A  zero  returned  indicates  normal 
completion  of  the  routine.  A  nonzero  value  indicates  an  I/O  error  that  should  abort  the 
entire  operation. 

If  you  want  to  write  a  host  program  for  MorphSDSub,  I  highly  recommend  that  you  look 
at  the  example  I/O  routines  Inputlmage()  and  OutputImage()  in  morphSd.c  morphSd.c 
is  a  sun  rasterfile  based  host  program  for  MorphSDSub. 

Structuring  element  specifications: 

A  companion  routine,  GetSE,  loads  or  creates  structuring  elements  for  MorphSDSub.  It 
returns  a  pointer  to  the  SE  list.  Its  calling  sequence  is  defined  as  follows: 

int  *GetSE(  SEName,  AutoSE,  SEType,  SorF,  sX,  sY,  sZ,  sV,  sorgx,  sorgy,  sorgz  ) 
char  *SEName;  filename  of  SE  file  or  NULL 

int  AutoSE;  nonzero  create  SE  internally 

int  *SEType;  binary  or  graylevel 

int  *SorF;  set  or  function 

int  *8X,*sY,*sZ,*8V;  end  box  size  and  max  gray  level 

int  *sorgx,*sorgy,*sorgz;  origin  cdts. 

{ 

body  of  routine 

} 
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SEName:  This  is  the  hlename  of  a  SE  file.  If  the  environment  variable  SEPATH  is  defined,  the 

value  of  it  is  prepended  to  the  filename  prior  to  opening  the  file.  If  this  value  is  NULL, 
GetSE()  assumes  that  it  is  being  asked  to  roll  its  own  SE. 

A  structuring  element  file  is  an  ASCII  file  of  integers  separated  by  spaces.  The  first  three 
numbers,  x,  y,  z  are  the  horizontal,  vertical,  and  depth  dimensions  in  pixels  of  the  smallest 
3D  box  that  will  cover  the  structuring  element.  Numbers  x,  y  and  z  must  be  >  0.  The 
next  three  numbers,  i,  j,  k  are  the  horizontal,  vertical,  and  depth  coordinates,  respectively, 
of  the  SE  origin.  IMPORTANT:  The  origin  is  expected  to  be  in  the  covering  box.  If 
not,  MorphSubSD  aborts.  The  front  upper  left  hand  corner  of  the  box  has  coordinates 
(0,0,0);  the  back  lower  right  is  (x-l,y-lz-l).  Following  the  first  six  integers  are  x*y*z 
integers  separated  by  spaces.  These  numbers  are  the  SE  elements.  Their  interpretation 
depends  on  the  morphological  operation  being  performed. 

Negative  SE  elements  are  ALWAYS  treated  as  logical  DON’T  CAREs.  That  is,  when 
the  operation  is  in  progress,  image  pixels  under  negative  SE  elements  are  ignored.  Thus, 
the  support  of  the  SE  is  limited  to  those  elments  that  are  nonnegative.  This  permits  the 
creation  of  odd-shaped  and  multiply  connected  SE’s  or  the  placement  of  the  SE  origin 
outside  the  body  of  the  SE.  If  ImgType  is  binary,  (i.e.  pixels  grouped  as  zero  and  not 
zero),  and  if  SEType  is  binary,  then  the  SE  is  used  to  perform  a  hit-or-miss  transform. 
In  this  case,  zero  SE  elements  cover  the  “miss”  support  and  positive  (nonzero)  elements 
cover  the  “hit”  support.  The  actual  gray-levels  are  ignored. 

If  ImgType  is  binary,  and  SEType  is  gray  then  the  nonnegative  (both  zero  and  greater 
than  zero)  SE  elements  determine  the  support  of  a  “hit-only”  transform.  That  is,  the 
nonnegative  suport  is  used  as  a  standard  set-type  SE  for  set  (binary)  morphology.  (Of 
course,  the  other  gray-level  info  is  ignored.)  Note:  if  ImgType  is  binary,  then  a  set 
operation  is  performed  by  default  (SorF  is  ignored). 

The  interpretation  of  the  SE  elements  for  ImgType  gray  depends  on  the  flags  SEType 
and  SorF,  just  as  for  morph: 

AutoSE:  If  this  value  is  zero  an  SE  file  is  read  in.  If  it  is  nonzero,  it  tells  GetSE()  to  either  use 

one  of  the  predefined  (canned)  SEs  or  to  create  an  SE.  Here  are  the  possible  values  for 
AutoSE  and  their  implications. 


mnemon  dec 

PLUS  1 

S3X3X3  2 

S5X5X5  3 

CONE  4 

CYLINDER  5 

SPHERE  6 


meaning 

use  3  by  3  by  3  3D  “H-”  shaped  SE 
use  3  by  3  by  3  cube  SE 
use  5  by  5  by  5  quasi  disk  SE 
make  a  conical  SE 
make  cylindrical  SE 
make  spherical  SE 


The  first  three  SEs  are  canned  SEs.  That  is  they  exist  as  data  structures  in  the  compiled 
code  of  Morph3DSub,  The  other  three  SEs  are  built  by  Morph3DSub  to  user  specifica¬ 
tion.  When  any  of  the  six  are  requested  SEType  is  set  to  gray  (SEType  ==  GRASE).  If 
one  of  the  canned  SEs  is  chosen  a  set  operation  is  performed  (SorF  ==  SET)  overriding 
the  input  parameter.  If  CONE,  CYLINDER,  or  SPHERE  is  chosen  a  set  operation  is 
performed  if  sV  ==  0,  and  a  function  operation  is  performed  if  sV  !=  0. 
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PLUS:  This  SE  has  a  voxel  at  its  origin  and  another  attached  to  each  face  of  the  origin 
voxel  for  as  total  of  7  voxel  elements.  It  looks  like  a  in  orthographic  projection  from 
any  x,  y  or  z,  direction. 

S3X3X3:  This  is  simply  a  3x3x3  cube  of  voxels  with  origin  at  the  center. 

S5X5X5:  Extend  each  face  of  the  3x3x3  cube  outward  by  one  layer  of  voxels  and  you  get 
this  SE.  I  looks  like  a  fat,  short-armed  3D 

CONE:  GetSE  makes  an  elliptical  cone  as  follows:  The  origin  is  a  single  voxel  at  the 
center  of  the  SE.  The  axis  of  the  cone  is  parallel  the  r-axis  (it  is  perpendicular  to  the 
image/voxel  planes).  Each  end  of  the  cone  is  an  sX  by  sY  elliptical  disk  (circular  disk,  if 
sX  =:  sY).  sX,  sY,  and  sZ  must  be  odd  and  greater  than  or  equal  to  1.  Each  end  of  the 
cone  b  (sZ-i)/2  image  planes  away  from  the  plane  of  the  origin.  For  example,  if  sZ  =  3, 
the  encL^  of  the  cone  are  in  the  planes  adjacent  to  the  origin  plane.  If  sZ=5  there  is  one 
image  plane  between  the  end  plane  and  the  origin  plane  at  each  end.  The  slope  of  the 
cone  in  the  xz  plane  is  sX/sZ  and  in  the  yz  plane  the  slope  is  sY/sZ.  SE  planes  between 
the  origin  and  the  ends  are  ellipical  disks  with  semimajor  and  semiminor  axes  determined 
by  the  slopes.  The  cone  is  actually  an  hourglass  in  shape.  This  shape  can  track  pixels 
in  a  time  sequence  that  have  a  maximum  velocity  in  {x,y)  given  by  (sX/sZ,sY/sZ).  If  a 
graylevel  value,  sV  >  0,  is  requested,  the  voxels  along  the  cone  £ixis  are  given  value  sV. 
The  graylevels  assigned  to  other  voxels  decrease  as  the  .square  root  of  the  radius  to  zero 
at  the  edges  of  the  cone. 

CYLINDER:  GetSE  makes  an  elliptical  cylinder  as  follows:  First,  it  makes  an  elliptical 
disk  with  a  diameter  of  sX  pixels  horizontally  and  sY  pixels  vertically.  sX  and  sY  must 
be  odd  and  greater  than  or  equal  to  1.  sV  is  the  gray  level  of  the  center  voxel.  If  sY  > 
0  is  requested  the  gray  level  of  the  voxels  decrease  as  the  square  root  of  the  radius  to 
zero  at  the  edge  of  the  disk.  Then,  the  disk  is  copied  sZ  times.  The  origin  of  the  SE  is 
placed  at  its  very  center  (center  of  the  disk  at  sZ/2).  A  cylindrical  SE  can  track  shapes 
in  a  time  sequence  larger  than  its  own  cross-section  that  are  not  moving  over  sZ  frames. 
A  cylinder  of  diameter  1  can  track  stationary  pixels. 

SPHERE;  This  is  a  spheroid  of  dimensions  sX  by  sY  by  sZ  It  is  made  by  GetSE  in  a 
fashion  analogous  to  the  construction  of  a  single  identiacal  to  one  slice  of  the  cylinder.  If 
sV  is  >  0  sV  is  the  value  of  the  center  pixel  and  the  gray-values  decrease  as  the  square 
root  of  the  radius  to  zero  at  the  edges.  The  graylevel  sphere  will  approximate  densities 
in  a  3D  voxel  image. 

Note  that  when  AutoSE  !=  0,  the  values  of  SEType  and  SorF  are  ignored. 
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Abstract 

The  CADDMAS  project  was  reviewed.  The  architecture  of  the  prototype 
of  the  proposed  full  scale  system  appears  sound,  although  some  potential 
problems  will  probably  be  found  as  the  users  of  the  system  gain  more 
experience.  The  software  development  and  operating  environment  for 
CADDMAS  is  adequate, but  less  than  ideal.  The  ideal  software  environment 
will  probably  not  exist  for  several  years  until  better  software  standards  are 
developed  for  parallel  computer  systems. 

Introduction 

The  CADDMAS  project  at  Arnold  Engineering  Development  Center  was 
reviewed  during  the  summer  of  1991  under  the  sponsorship  of  the  United 
States  Air  Force,  Office  of  Scientific  Research,  Summer  Faculty  Research 
Program.  Mr.  Tom  Tibbals  of  Sverdrup  Technology,  Inc.  was  the  focal  point 
for  the  program.  The  review  covered  three  aspects  of  the  CADDMAS  project: 
1)  the  appropriateness  of  the  proposed  architecture,  2)  the  potential  for 
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running  existing  software  on  CADDMAS,  and  3)  the  potential  for  assigning 
the  CADDMAS  resources  dynamically. 


Description  of  CADDMAS 

CADDMAS  is  an  acronym  for  Computer  Assisted  Dynamic  Data 
Monitoring  and  Analysis  System,  The  proposed  system  will  acquire  data  in 
digital  form  from  sensors  mounted  on  experimental  turbine  engines  while  the 
engines  are  tested.  CADDMAS  will  also  analyze  the  data,  and  report  the 
results  in  real-time.  Details  of  a  scaled  dowr.,  proof-of-concept  system  have 
been  reported  elsewhere  [1]. 

CADDMAS  is  a  multi-processor  computer  system.  Several  types  of 
processor  elements  are  used  in  the  system.  Most  of  these  processing  elements 
are  replicated  through  out  the  system. 

One  processing  element  is  a  dedicated  data  acquisition  system  based  on 
the  Motorola  DSP56001  DSP  microprocessor.  A  Motorola  DSP96002  DSP 
microprocessor  will  be  used  in  the  proposed  system.  The  system  acquires  data 
from  a  turbine  sensor,  does  some  analysis  on  the  data,  and  reports  the  results 
to  the  rest  of  CADDMAS.  The  proposed  system  will  also  have  built-in  self-test 
capability. 
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Another  processing  element  is  based  on  the  INMOS  Transputer  [2].  The 
Transputer  is  a  family  of  microprocessors  that  can  be  used  to  construct  MIMD 
(Multiple  Instruction  stream.  Multiple  Data  stream)  parallel  computer  systems. 
Several  of  the  Transputer  elements  have  Zoran  ZR34325  vector  signal 
processors  as  co-processors  [31. 

Data  can  be  saved  on  one  or  more  data  storage  elements.  These  systems 
can  save  data  on  either  hard  disk  or  cassette  tape. 

The  user  of  the  system  communicates  with  the  processing  elements 
through  a  personal  computer.  Results  can  be  displayed  on  the  personal 
computer's  monitor.  However,  the  system  also  has  several  high  performance 
graphic  monitors  for  displaying  graphs  [1]. 

All  of  these  processing  elements  are  connected  together  with  serial 
communication  links  based  on  the  Transputer  protocol.  Some  of  these  links 
are  static.  Others  are  dynamic  and  can  be  altered  by  writing  the  appropriate 
bit  pattern  into  an  INMOS  IMS  COxx  family  Programmable  Link  Switch  [2]. 

CADDMAS  will  replace  a  current  system  where  data  is  recorded  on 
analog  tape.  The  analog  tape  data  is  converted  to  digital  code  off  line  and 
analyzed  on  mainframe  computers  or  vector  supercomputers. 
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The  CADOMAS  Architecture 


The  review  of  the  architecture  was  based  on  reading  some  of  the 
literature  cited  in  this  report  and  s^'me  internal  reports  that  were  not  cited. 
The  review  was  also  based  on  several  discussions  with  Mr.  Tom  Tibbals  of 
Sverdrup,  and  Mr.  Ted  Bapty  and  Mr.  Ben  Abbott  of  Vanderbilt  University,  the 
engineers  who  have  done  most  of  the  design  and  development  work  on 
CADDMAS.  Discussions  were  also  held  with  Mr.  Jim  Nichols  of  Sverdrup,  an 
engineer  who  will  probably  use  the  system.  Based  on  the  information 
available  to  the  author,  it  appear  the  proposed  architecture  of  CADDMAS  is 
well  thought  out  and  that  the  engineers  primarily  involved  in  the  project  are 
well  aware  of  some  of  the  potential  problems  and  risks  they  face  developing  a 
final  system  that  will  meet  the  system’s  specifications. 

Several  strategies  have  been  used  to  obtain  the  high  processing  rates 
required  for  CADDMAS.  The  multi-processor  and  parallel  processor  approach 
using  relatively  inexpensive  microprocessor  components  is  cost  effective. 
Using  dedicated  processors  for  data  acquisition  and  analysis  of  some  of  the 
data  that  will  be  required  by  ail  users  of  CADDMAS  is  common  practice  for 
large,  dedicated  data  acquisition  systems.  This  approach  is  indeed 
appropriate  for  CADDMAS.  The  dedicated  processors  are  cost  effective  and 
will  free  up  the  remainder  of  the  system  for  user  specific  data  analysis.  The 
design  engineers  appear  to  be  willing  to  add  more  dedicated  processor 
capability  if,  and  when,  data  processing  functions  common  to  most  CADDMAS 
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users  are  identified.  This  philosophy  is  appropriate,  dearly  within  mainstream 
computer  engineering  thought,  and  strongly  encouraged  by  the  author  of 
this  study. 

The  use  of  vector  co-processors  in  one  or  more  Transputer  elements  is 
also  appropriate  and  advantageous.  The  cost  of  adding  a  high  performance 
integrated  circuit  vector  processor  is  relatively  small.  Vector  processors  have 
definite  advantages  for  handling  fine  grain  parallel  processing  algorithms, 
such  as  some  types  of  array  processing.  A  MIMD  architecture  is  more 
appropriate  for  handling  medium  and  coarse  grain  parallel  algorithms,  such 
as  processing  similar  data  from  several  processor  channels.  Some  of  the 
engineers  who  will  use  CADDMAS  have  already  had  experience  running 
software  on  vector  processors,  such  as  the  Cray  supercomputers.  The  vector 
co-processors  in  some  Transputer  elements  will  provide  a  vehicle  for 
expressing  fine  grain  parallelism. 

MIMD  parallel  processing  using  low  cost  integrated  circuit  technology  is 
clearly  an  appropriate  design  strategy  for  the  CADDMAS  project  because  of 
the  project's  high  data  processing  rate  requirements.  The  extensibility  of  the 
design  is  also  desirable  since  the  demand  for  data  processing  capability  will 
probably  expand  as  the  users  of  CADDMAS  gain  more  experience  with  multi¬ 
processor  data  processing.  The  design  engineers  are  well  aware  that  the 
major  problem  with  using  a  parallel  processing  architecture,  be  it  vector 
processing  or  MIMD,  is  writing  effective  and  efficient  software  for  the 
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systems.  Some  of  the  issues  involved  are  discussed  in  a  later  section  of  this 
report.  However,  two  trends  in  the  development  of  technology  are 
minimizing  this  risk. 

One  trend  is  that  the  performance/price  ratio  of  semiconductor 
integrated  circuit  technology  is  halving  approximately  every  18  months.  This 
trend  will  probably  continue  for  the  immediate  future.  As  a  result  of  this 
trend,  even  the  inefficient  use  of  the  processing  power  of  MIMD  hardware  is 
becoming  more  and  more  cost  effective. 

The  second  favorable  technology  trend  is  that  several  novel  approaches 
to  more  effective  software  development  for  MIMD  systems  are  being 
explored.  The  low  cost  of  microprocessor-based  MIMD  architectures  has 
generated  a  lot  of  research  into  effective  programming  techniques  for  these 
system.  Some  of  these  approaches  are  discussed  later. 

While  some  risk  is  involved  in  using  a  parallel  architecture,  it  is  a 
reasonable  risk  given  the  low  hardware  costs  associated  with  MIMD  design. 
Moreover,  improving  technology  is  decreasing  the  risk. 

The  choice  of  the  Transputer  for  the  main  processing  element  is  also  an 
appropriate  choice.  There  are  several  microprocessor  chips  that  could  be  used 
in  a  MIMD  system.  Intel  uses  its  own  80386  and  860  chips  in  the  MIMD  systems 


it  sells  through  its  Intel  Supercomputer  Systems  Division.  The  Transputer  has 
several  features  that  make  it  attractive  for  MIMD  systems  [2]. 

A  linear  arrangement  of  Transputers  is  probably  appropriate  for  most  of 
the  data  processing  of  CADDMAS  since  this  arrangement  corresponds  to  the 
physical  arrangement  of  the  sensors  on  a  turbine  engine.  However,  the 
flexibility  built  into  the  connections  of  the  Transputers  in  CADDMAS  is  clearly 
appropriate.  The  Programmable  Link  Switch  is  an  appropriate  strategy  for 
quickly  moving  sensor  data  to  a  graphical  display.Some  users  of  the  system 
may  wish  to  cross  correlate  data  between  sensors.  The  Programmable  Link 
Switch  in  CADDMAS  gives  the  user  the  ability  to  exchange  data  between 
processors  directly,  rather  than  transmitting  the  data  through  another 
processor.  This  feature  will  improve  performance  if  used  properly.  Exactly 
which  Transputer  elements  will  need  to  be  connected  through  the 
Programmable  Link  Switch  will  probably  need  to  be  settled  after  the 
CADDMAS  users  have  gained  experience  with  the  system. 

Using  an  IBM  PC  style  personal  computer  as  a  console  for  the  system  is 
also  appropriate.  A  workstation  would  provide  a  higher  performance  user 
interface  with  superior  graphics  capability.  Unfortunately,  most  of  the 
CADDMAS  users  are  probably  unfamiliar  with  workstations  and  the  UNIX 
operating  system  that  runs  on  most  workstations.  The  users  are  more  likely  to 
be  comfortable  with  an  MS  DOS,  PC  style  interface.  They  may  also  wish  to  run 
PC  programs  such  as  Lotus  1-2-3.  The  selection  of  a  PC  interface  should  not 
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seriously  degrade  system  performance  and  will  probably  improve  its  utility. 
CADDMAS  has  high  performance  graphics  monitors  as  part  of  its  architecture. 
Most  users  will  probably  not  need  the  higher  processing  performance  of  a 
workstation  since  there  should  be  adequate  capability  from  the  Transputer 
elements.  Further,  the  performance  of  PCs  is  approaching  that  of 
workstations  as  technology  improves.  This  trend  is  minimizing  the 
performance  difference  between  PCs  and  workstations. 

The  engineers  developing  CADDMAS  are  concerned  about  the 
compatibility  of  CADDMAS  software  with  new  processing  element 
technology.  This  concern  is  indeed  valid.  Most  of  the  these  issues  are  covered 
in  a  later  section  of  this  report.  It  is  important  to  note  that  INMOS  has 
introduced  higher  performance  versions  of  the  Transputer  chips  as 
semiconductor  technology  has  improved  [2].  The  improved  chips  have 
machine  code  compatibility  with  the  older  chips.  Other  compatibility  issues 
are  discussed  in  a  later  section. 

There  is  one  potential  problem  that  AEDC  may  have  with  CADDMAS. 
The  DFT  is  based  on  several  assumptions  about  the.  vibrations,  the  most 
important  of  which  is  that  the  vibrations  are  sinusoidal  [4],  [5].  Sinusoidal 
vibrations  always  occur  in  homogeneous  materials  with  rectangular 
boundaries  (for  two  dimensions)  or  plane  boundaries  (for  three  dimensions) 
[6j.  Any  curvature  in  a  boundary  will  tend  to  focus  the  spacia!  waves  and 
produce  standing  waves  in  space.  The  time  waveform  at  any  point  in  space 
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will  still  be  sinusoidal.  However,  the  spacial  waveform  will  not  be  exactly 
sinusoidal  even  though  it  may  be  close  to  sinusoidal.  Using  a  DFT  to  analyze  a 
time  waveform  at  any  point  in  space  should  introduce  no  problem,  provided 
the  DFT  is  interpreted  correctly.  Correlating  the  time  waveforms  from  two 
points  in  space  may  produce  problems  unless  the  focusing  effects  of  boundary 
curvature  are  taken  into  account. 

The  inhomogenaities  and  anisotropic  behavior  of  some  composite 
materials  will  introduce  additional  boundaries  that  will  produce  additional 
reflections.  The  additional  boundaries  can  introduce  additional  harmonics 
that  are  not  integer  multiples  of  the  harmonics  produced  by  the  physical 
boundaries  of  the  materials  even  if  all  the  spacial  harmonics  are  sinusoidal. 
The  inhomgenaities  can  also  produce  focusing  that  will  tend  to  produce  non- 
sinusoidal  standing  waves  in  space. 

The  issues  involved  are  discussed  in  more  detail  in  another  publication 
[7].  The  effects  of  curvilinear  boundaries,  inhomgeneous  materials,  or 
anisotropic  materials  may  produce  the  demand  for  other  discrete  transform 
software  that  is  more  appropriate  for  the  vibrations  that  may  be  encountered. 

Running  Existing  Software  on  G^DDMAS 

Any  software  conforming  to  the  standards  of  a  common,  higher  level 
programming  language  can  potentially  be  re-compiled  to  run  on  a  Transputer 
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with  little  or  no  modification.  (A  standard  is  assumed  to  be  set  by  ANSI  or 
some  other  recognized  standards  agency.  A  common  language  is  assumed  to 
be  one  used  by  engineers  to  program  two  or  more  different  computers. 
Fortran,  C,  Ada,  and  Modula-2  would  be  examples  of  common,  standard 
higher  level  languages.)  The  machine  code  produced  by  the  compiler  may  run 
on  only  one  processing  element  unless  the  compiler  can  spot  parallelism  in  the 
program.  In  many  cases,  the  software  will  need  to  be  rewittten  to  distribute 
the  software  over  two  or  more  processing  elements  even  if  the  compiler  can 
spot  some  parallelism  in  the  software. 

The  current  support  for  programming  parallel  computer  systems  differs 
from  the  ideal.  These  differences  will  probably  exist  for  many  years  until  more 
research  is  done  into  how  to  program  parallel  computer  systems  and  better 
standards  are  developed  for  programming  languages.  Some  of  the  issues  are 
discussed  below.  More  information  is  given  in  [8]. 

Ideal  Software  Support 

It  would  be  desirable  for  a  compiler  to  take  an  existing  program  written 
in  a  standard  higher  level  programming  language  and  automatically  identify 
the  parallelism  in  the  program.  Indeed,  some  progress  has  been  made  in 
developing  compiler  algorithms  that  will  identify  parallelism  [9].  Whether 
any  commercial  compiler  available  for  the  Transputer  [10]  takes  advantage  of 
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these  techniques  is  unknown  to  the  author.  A  compiler  written  at  Georgia 
Tech  is  reported  to  detect  parallelism  [11], 

It  is  unlikely  that  even  the  best  compiler  will  ever  be  able  to  identify 
parallelism  in  a  program  if  the  programmer  uses  an  inherently  sequential 
algorithm.  A  good  higher  level  programming  language  for  programming 
parallel  processors  needs  paradigms  for  identifying  parallelism. 
Unfortunately,  ai!  current  standards  assume  sequential  processing.  Most 
Transputer  compilers  for  common  standard  higher  level  programming 
languages  have  extensions  to  the  standard  to  support  expressing  parallelism. 
The  use  of  these  extensions  will  complicate  recompiling  the  rewritten 
software  on  another  vendor's  compiler.  It  will  also  complicate  the 
recompilation  of  the  software  on  a  standard  compiler  once  good  standards 
for  identifying  parallelism  are  accepted  and  adopted. 

Existing  Software  Support 

Ada,  C,  Fortran,  and  Modula-2  compilers  are  all  available  for  the 
Transputer.  So  are  compilers  for  less  common  languages  [10]. 

Based  on  the  author's  experience  transporting  Fortran,  Ada,  and  C 
programs  from  one  machine  to  another,  Ada  programs  will  present  the 
fewest  re-compilation  problems,  and  Fortran  programs  will  present  the  most 
problems.  Ada  seems  to  have  the  most  machine  independent  paradigms,  and 
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Fortran  seems  to  have  the  most  machine  dependent  paradigms.  (Character 
manipulation  in  Fortran  is  a  good  example  of  a  paradigm  that  is  both  machine 
specific  and  covered  by  a  poor  standard.) 

Two  common  higher  level  languages,  Ada  and  Modula-2,  do  have  some 
standard  mechanisms  that  can  potentially  support  parallel  programming. 
Both  Ada  and  Modula-2  support  the  concept  of  tasks  [12],  [13].  Alsys  has 
developed  an  Ada  compiler  for  the  Transputer  that  does  support  parallel 
processing  with  static  tasks  [14].  Tasks,  or  coprocesses  as  they  are  sometimes 
called,  were  originally  intended  to  support  concurrent  programming  on  single 
processor  machines.  However,  there  are  many  similarities  between  parallel 
processing  where  messages  are  exchanged  between  the  processors,  and 
concurrent  processing. 

Tasks  are  similar  to  procedures  or  functions  in  the  way  they  are  written. 
Tasks  can  be  called  and  they  can  return  values.  There  is  a  fair  amount  of 
overhead  entering  and  leaving  tasks.  Tasks,  as  a  programming  concept  for 
parallel  programming,  are  more  suitable  for  medium  and  coarse  grain  parallel 
algorithms. 

The  proposed  new  Fortran  standard,  FORTRAN  90,  is  reported  to  have 
support  for  programming  parallel  processors  [15].  The  standard  is  reported  to 
support  the  programming  of  vector  processors.  Whether  the  proposed 
standard  can  support  medium  or  coarse  grain  parallelism  remains  to  be  seen. 
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No  Transputer  compilers  that  the  author  is  aware  of  have  extensions  to 
support  parallel  processing  on  the  vector  co-processor  in  CADDMAS. 
Subprograms  {functions  or  procedures)  are  available  that  can  be  called  to  use 
the  co-processor.  Standard  C  compilers  support  a  higher  level  construct  called 
pointers  which  can  be  used  to  directly  control  a  memory-mapped  co-processor 
without  the  overhead  of  subprogram  calls  and  returns  [16].  Standard  Ada 
compilers  can  support  direct  control  of  memory-mapped  co-processors 
through  representation  specifications  [12]. 

Utilizing  The  Capacity  of  CADDMAS 

Utilizing  the  capacity  of  CADDMAS  is  another  area  where  the  ideal 
environment  differs  from  the  existing  environment. 

Ideal  Operating  Environment 

In  the  ideal  operating  environment,  tasks  should  be  assigned 
dynamically  to  processing  elements  that  are  available  to  execute  a  task. 
Multiple  tasks  should  be  assigned  to  one  processing  element  if  appropriate.  A 
task  should  be  divided  and  distributed  over  two  or  more  processing  elements 
if  the  extra  proce' Jng  elements  become  available.  A  processing  element  that 
fails  should  be  detected  and  taken  out  of  service. 
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The  dynamic  assignment  of  tasks  requires  a  control  program,  or 
operating  system.  It  also  requires  a  compiler  that  will  produce  tasks  in  a  form 
acceptable  to  the  operating  system  so  that  the  tasks  can  be  assigned 
dynamically.  Many  techniques  have  been  proposed  for  generating  and 
assigning  dynamic  tasks  in  parallel  computer  systems.  There  seems  to  be  no 
common  agreement  on  the  best  way  to  do  so.  It  will  probably  be  some  time 
before  widely  accepted  standards  exist  in  this  area.  Since  the  issues  are  closely 
related  to  some  of  the  issues  involved  in  generating  standard  programming 
languages  for  parallel  computer  systems,  standards  for  programming 
languages  will  probably  be  developed  first.  Some  of  the  proposed  solutions 
are  discussed  in  more  detail  elsewhere  [8]. 

Current  Operating  Environment 

The  Transputer  does  have  several  operating  systems  available  for  it  [10]. 
At  least  one,  Helios,  supportsthe  dynamic  assignment  of  tasks  [17],  [18]. 

In  the  most  common  Transputer  operating  environment,  there  is  no 
control  program  per  se.  Each  Transputer  does  have  built  into  its  instruction  set 
the  ability  to  initiate,  susperd,  and  switch  tasks.  It  also  has  the  ability  to 
exchange  messages  between  tasks  and  load  programs  into  itself  through  one 
of  Its  communications  ports  after  restart  [2].  A  communication  program  on  a 
host  computer  loads  the  tasks  into  each  Transputer  on  restart. 
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Tasks  are  assigned  statically  by  the  programmer  in  most  current 
Transputer  environments.  The  number  of  tasks  and  the  Transputer  they  are 
assigned  to  is  determined  by  the  programmer.  These  decisions  are  made 
when  the  tasks  are  written  or  compiled,  depending  on  the  compiler.  Tasks 
must  be  reassigned  and  the  software  must  be  recompiled  if  a  Transputer  is 
taken  out  of  service  for  any  reason. 

Conclusions 

1.  The  current  system  for  acquiring  and  analyzing  experimental  turbine 
engine  test  data  is  antiquated  and  clearly  needs  to  be  replaced  with  a 
real-time,  computer-based  data  acquisition  and  analysis  system.  The  real 
issue  is  whether  CADDM  AS  is  a  sound  solution. 

2.  The  architecture  of  CADDMAS  is  clearly  appropriate.  The  engineers 
designing  the  system  are  approaching  the  problem  in  a  realistic  and 
responsible  manner.  The  computer  architects  are  using  appropriate  off 
the  shelf,  state  of  the  art  technology  and  are  designing  only  what 
cannot  be  purchased  as  a  turnkey  product. 

3.  The  designers  of  CADDMAS  are  attempting  to  take  advantage  of  state 
of  the  art  computer  architecture,  especially  parallel  processing.  They 
really  have  no  choice  but  to  use  a  multi-processor  system  to  obtain  the 
computation  power  needed.  They  have  identified  dedicated  tasks  and 
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used  co-processors  or  dedicated  processors  for  these  functions. 
Dedicated  processors  and  co-processors  are  proven  design  techniques. 
Parallel  processors  are  being  used  only  for  high  power,  general  purpose 
computation.  Parallel  processing  is  a  proven  hardware  technique, 
although  there  is  some  technological  risk  in  programming  parallel 
processors.  The  designers  clearly  understand  and  are  concerned  about 
these  risks. 

Most  of  the  programming  risk  of  using  parallel  processing  will  be  in 
writing  or  purchasing  software  that  will  take  full  advantage  of  the 
processing  capability  of  CADDMAS.  Some  of  the  application  and 
operating  system  software  will  need  to  be  developed  through  trial  and 
error  as  the  users  of  CADDMAS  learn  more  about  parallel  processing 
technology  and  what  CADDMAS  can  do  to  help  them.  In  the  process,  the 
users  of  the  system  will  probably  learn  more  about  the  problems  they 
want  to  solve  with  CADDMAS  and  will  better  be  able  to  express  their 
wants  and  needs  in  terms  that  can  be  understood  by  computer  architects 
and  programmers.  Research  being  conducted  outside  AEDC  into  the 
best  software  techniques  for  programming  parallel  processing  systems 
will  also  have  a  strong  and  favorable  impact  on  writing  better  software 
for  CADDMAS. 

5.  The  risk  is  not  that  CADDMAS  will  fail.  Rather,  the  risks  are  that  1)  the 
software  will  not  efficiently  use  the  hardware,  2)  the  time  to  develop 
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efficient  software  may  be  excessive,  3)  existing  software  may  not  be  as 
portable  to  CADDMAS  as  desired  by  some  of  its  potential  users,  or  4)  the 
software  may  not  be  as  portable  as  desired  to  any  future  computing 
equipment  that  will  eventually  replace  CADDMAS. 

6.  The  hardware  for  the  proposed  system  is  so  inexpensive,  and  should 
become  even  less  expensive  in  the  future,  that  the  risk  of  inefficiently 
using  the  hardware  of  the  system  can  be  overcome,  to  a  significant 
extent,  by  expanding  the  hardware  of  the  system.  An  advantage  of  a 
good  parallel  processing  architecture  is  its  extensibility.  The  proposed 
CADDMAS  architecture  will  have  some  extensibility. 

7.  In  spite  of  the  potential  software  problems  with  CADDMAS,  the  risk  to 
reward  ratio  is  so  high  that  even  the  worst  case  risk  is  acceptable. 
Indeed,  continuing  to  use  the  current  data  acquisition  and  analysis 
system  or  even  replacing  the  current  analog  data  acquisition  system  with 
the  data  acquisition  co-processor  developed  for  CADDMAS,  would  be 
risker  choices.  These  alternatives  would  continue  the  use  of  obsolete 
technology.  The  current  data  analysis  computing  facilities  need  to  be 
replaced  with  modern,  less  expensive  technology.  There  would  be  little, 
if  any,  reward  for  using  the  obsolete  technology. 

8.  Several  key  software  modules  that  will  assist  a  programmer  writing  data 
analysis  application  programs  have  already  been  written  for  or  adapted 
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to  CADDMAS  as  part  of  the  proof  of  concept  system.  Modules  include 
graphics  display  software,  including  an  extension  to  support  Campbell 
diagrams,  and  a  simple  data  acquisition  and  analysis  program.  This  work 
should  continue  as  additional  software  modules  are  identified.  One 
module  that  should  be  developed  is  a  suite  of  diagnostic  programs  so 
that  a  Transputer  processing  element  can  be  tested.  The  test  module 
would  expedite  maintaining  the  CADDMA'^  system  when  it  fails.  It 
would  also  assist  in  the  dynamic  assignment  of  tasks  to  Transputer 
elements  if  a  processor  element  fails  during  an  engine  test. 

References 

[1]  T.  Bapty,  B.  Abbott,  and  J.  Sztipanovits,  "Real-Time  Turbine  Engine  Data 
Visualization,"  in  Transputer  Research  and  Applications  4.  D.  L  Fielding, 
Ed,  Amsterdam:  lOS  Press,  1990,  pp.  205-214. 

[2]  Anon..  The  Transputer  Databook.  2nd  ed.  Bristol,  UK:  INMOS,  1989. 

[3]  Anon.,  ZR34325  32-Bit  Floating-Point  Vector  Signal  Processor 
Engineering  Data.  Santa  Clara,  CA:  Zoran  Corp.,  May  1990. 

[4]  C.  S.  Burrus  and  T.  W.  Parks,  DFT/FFT  and  Convolution  Algorithms.  Theory 
and  Implementation.  New  York:  John  Wiley  &  Sons,  1985. 


27-18 


[5]  R.  W.  Ramirez,  The  ^>T.  Fundamentals  and  Concepts.  Englewood  Cliffs, 
NJ:  Prentice-Hall,  1985. 

[61  P.  M.  Morse  and  H.  Feshbach,  Methods  of  Theoretical  Physics.  New  York, 
NY:  McGraw-Hill,  1953. 

[7]  D.  L.  Smith,  "Common  problems  using  the  DFT  to  analyze  mechanical 
vibrations,"  in  preparation. 

[8]  D.  L  Smith,  "The  attraction  of  building  parallel  computer  systems  from 
inexpensive  processing  element,  and  the  problems,"  in  preparation. 

[9]  J.  R.  Allen  and  K.  Kennedy,  "PFC:  a  program  to  convert  fortran  to 
parallel  form,"  in  Supercomputers:  Design  an  Applications.  K.  Hwang 
Ed.  Los  Angeles,  CA:  IEEE  Computer  Society  Press,  1984,  pp.  186-203. 
Reprinted  from  The  Proc.  of  the  IBM  Conf.  on  Parallel  Comput.  and  Sci. 
Computations.  Rome,  Italy:  IBM,  1982. 

[10]  Anon..  The  Transputer  White  Pages,  Software/Consultants  Directory.  3rd 
ed.  Bristol,  UK:  INMOS  Ltd.,  Jan.  1990. 

[11]  B.  Abbott,  personal  communication,  July  1991. 


27-19 


[12]  G.  Booch,  Software  Engineering  with  Ada.  2ncl  ed.  Menlo  Park,  CA: 


Benjamin/Cummings,  1987. 

[13]  N.  Wirth.  Programming  in  Modula-2.  2nd  ed.  Berlin:  Springer-Verlag, 
1983. 

[14]  J.  G.  P.  Barnes,  "Ada  on  transputer  arrays,"  Applications  of  Transputers 
1,  L.  Freeman  and  C.  Phillips,  Eds.  Amsterdam:  lOS  Press,  1990,  pp.  1-9. 

[15]  P.  Wallich,  "FORTRAN  forever,  is  it  still  the  language  of  choice  for 
science?."  Scientific  American,  vol.  265,  p.  112,  June  1991. 

[16]  B.  Kernighan  and  D.  Ritchie,  The  C  Programming  Language.  2nd  ed. 
Englewood  Cliffs,  NJ:  Prentice-Hall,  1988. 

[17]  Anon.,  Helios  Developer's  Notes.  Somerset,  U.K.:  Perihelion  Software 
Ltd. 

[18]  D.  Pountain,  "A  personal  transputer,"  Bvte.  vol.  13,  pp.  303-308,  June 
1988. 


27-20 


WAKE  AND  PROJECTILE  VELOCITY  ESTIMATION 


D.  Mitchell  Wilkes  &  Georges  Badih  Aboutanos 
Vanderbilt  University 
Department  of  Electrical  Engineering 
Nashville,  TN  37235 


1  Abstract 

Digital  signal  processing  techniques  were  used  to  estimate  the  velocity  of  a  projectile 
and  its  wake.  The  ob..8rved  data  was  collected  by  one  or  multiple  doppler  radars  in 
an  imdergroimd  ballistic  range. 

A  classical  FFT-based  spectral  estimation  approach  was  used  to  identify  the 
frequency  characteristics  of  the  data.  These  frequencies  were  later  translated  into 
velocities  leading  to  accurate  velocity  profiles  of  the  projectile  and  of  the  wake. 

A  C  program  was  also  developed  to  implement  the  spectral  estimation  approach 
and  to  provide  an  automated  analysis  of  the  data  wth  extensive  graphical  display 
of  the  results.  These  graphical  presentations  include  velocity  profiles,  3-D  surface 
plots,  and  contour  plots. 

2  Introduction 

The  main  goal  of  this  research  v/as  to  develop  an  automated  analyds  technique 
for  doppler  radar  data  from  a  baUistic  range.  The  doppler  data,  is  collected  on 
a  projectile  followed  by  its  wake,  then  it  is  automatically  analysed  to  provide  a 
graphical  and  numerical  description  of  the  velocity  behavior  of  the  projectile  as 
well  as  the  wake,  which  is  more  difficult  to  analyze. 
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Projectile 


Radar  1 
Frequency  1 


Figure  1:  Projectile  velocity  measurement. 

2.1  Physical  Description  of  the  Range 

The  measurements  are  tahen  in  a  long  imderground  range,  usually  evacuated  to  a 
low  pressure.  A  typical  projectile  is  about  10  cm  long  and  2  cm  in  diameter  is  fired 
down  range  by  a  two-stage  compressed  light-gas  gim  at  a  speed  of  approximately 
2Q,Q00  ft/s.  As  the  projectile  travels  down  the  range  a  few  dopplcr  radars,  usually 
three,  each  at  a  difierent  frequency  (8,6,  17  and  35  GHz),  measure  the  velocity 
of  the  projectile  and  the  wake  following  the  projectile.  These  radar  antennas  are 
usually  located  about  45'’  from  the  line  of  path  of  the  projectile.  (See  Figure  1) 

2.2  Description  of  the  Measurement  System 

This  section  describes  the  radar-  measurement  system  and  the  way  the  doppler 
frequency  is  measured  and  converted  to  velocity.  The  radar  is  transmitting  at 
some  frequency  fr.  This  is  a  quasi-monostatic  radar  system  where  the  transmitter 
antenna  and  the  receiver  antenna  are  co-located.  When  the  radar  v/ave  hits  the 
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Receiver  Radar  Antenna 


Figure  2:  Radar  measxirement  system. 

moving  projectile  or  its  wake  it  reflects  back  to  the  radar  receiver  at  the  frequency 
fr  shifted  up  by  a  floppier  frequency  f^. 

The  received  frequency  fr  +  fd  is  down  converted  to  an  intermediate  frequency 
fiF  +  fd’  A  Log  amplifier  is  used  to  collect  eunplitude  data,  and  the  limited  IF  data 
is  separated  into  an  In-phase  channel  and  a  Quadratme  channel  (which  is  phase 
shifted  by  90®  relative  to  the  In-phase  channel).  The  data  is  then  downconverted 
to  DC  by  multiplying  it  again  with  quadrature  sinusoids  at  fjp,  and  finally  each 
channel  is  passed  through  a  low  pass  filter  leading  to  a  floppier  frequency  related  to 
the  speed  of  the  projectile  and  its  wake  at  the  In-phase  and  Quadrature  channels 
(See  Figures  2  and  3).  LPF  is  the  low  pass  filter,  fjp  is  the  intermediate  frequency, 
fr  is  the  radar  frequency  and  fd  is  the  floppier  frequency. 

The  floppier  frequency  is  converted  to  velocity  using  the  following  equations  [1]: 

r  2t;cos(0)/, 

fd  = - —  (1) 
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2  COs{6)fr 

where  v  is  the  velocity  of  the  target,  6  is  the  angle  between  the  line  of  the  path 
of  the  projectile  and  the  radar  line  of  sight  (uaually  45°),  c  is  the  speed  of  light, 
3  *  10®m/s,  fr  is  the  radar  frequency,  and  /j  is  the  doppler  frequency. 

Substituting  fd  into  Equation  (2)  will  provide  the  velocity  of  the  projectile  or  of 
the  wake,  v.  The  velocity  and  the  doppler  frequency  are  proportional,  which  means 
the  larger  the  doppler  shift  the  faster  the  projectile  is  moving  and  the  smaller  the 
doppler  shift  the  slower  the  projectile  is  moving. 

2.3  Description  of  the  Projectile  and  the  Wake  Results 

The  raw  data  is  nominally  taken  at  a  sampling  rate  of  200  ns  per  point.  This 
data  typically  consists  of  fom  sections.  The  first  section  is  random  noise  at  low 
amplitude,  the  second  section  is  the  return  from  the  projectile,  which  is  the  first 
object  of  interest  seen  by  the  radar.  This  section  is  characterized  by  a  large  constant 
magnitude  and  a  high  doppler  frequency.  The  third  section  is  the  return  off  the  wake 
and  it  has  basically  the  same  constant  magnitude  behavior  as  the  projectile,  but 
the  frequency  is  lower  because  the  wake  moves  slower  than  the  projectile.  Finally 
the  fourth  section  of  the  data  is  random  noise  again  at  low  amplitude.  An  example 
of  such  data  for  6000  points  and  a  200  ns  sampling  period  is  shown  in  Figure  4. 

The  projectile  and  the  wake  can  be  distinguished  easily  from  the  backgroxmd 
noise  because  they  have  a  higher  amplitude.  After  converting  the  raw  data  to 
velocity  we  observe  a  null  area  immediately  behind  the  projectile  that  is  due  to 
the  recombination  of  gases.  This  feature  can  be  used  to  separate  the  wake  from 
the  projectile  so  that  the  velocity  behavior  of  each  (i.e.,  projectile  and  wake)  can 
be  observed  separately.  The  final  result  will  be  displayed  in  terms  of  percentage 
of  the  model  speed  versus  distance  from  the  begining  of  the  projectile  measured  in 
projectile  body  diameters.  A  sample  of  such  a  result  is  shown  in  Figure  5. 
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Raw  doppler  data  amplitude 


PROJECTILE+WAKE  (17.0  GHz) 
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Figure  5:  Extracted  projectile  and  wake  velocity  profile. 
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3  Problem  Discussion 


3.1  Statement  of  the  Problem 

Provided  with  some  doppler  data  collected  on  a  projectile  followed  by  its  wake, 
our  goal  was  to  use  digital  signal  processing  techniques  to  estimate  the  velocity  of 
the  projectile  and  the  velocity  of  the  wake  for  possibly  multiple  radar  frequencies 
such  as  8.6,  17  and  35  GHz.  The  analysis  of  the  data  and  the  estimation  technique 
should  be  fully  automated  providing  a  projectile  and  wake  velocity  profile,  and  a 
wake  velocity  profile  alone.  New  visualization  techniques  were  also  proposed  to 
provide  a  clearer  understanding  of  the  projectile  and  wake  velocity  behaviors. 

3.2  B£isic  Approach 

The  basic  approach  of  this  effort  was  as  follows.  The  doppler  data  is  broken  into 
many  possibly  overlapping  windows.  Spectral  estimation  is  performed  separately  on 
each  window.  We  find  the  location  of  the  tallest  spectral  peak  in  each  window  and 
use  it  as  our  estimate  of  the  doppler  frequency  for  that  time  window.  The  peaks 
from  each  time  window  are  collected  and  represent  the  doppler  frequency  behavior 
as  a  fimction  of  time.  These  doppler  frequencies  are  then  translated  directly  into 
velocities  via  Equation  (2),  and  displayed  in  different  graphical  formats  describing 
the  velocity  behavior  of  the  data.  Analysis  is  performed  to  extract  the  velocity 
profile  of  the  projectile  and  the  wake  and  to  distinguish  them  from  the  background 
noise.  This  basic  approach  will  now  be  described  in  greater  detail. 

3.3  Details  of  Spectral  Estimation 

Spectral  estimation  is  used  to  identify  the  frequency  characteristics  of  the  data, 
which  can  be  translated  directly  into  velocity.  A  classical  FFT-based  spectral  esti¬ 
mation  approach  was  chosen  due  to  its  well  behaved  characteristics  [2].  Although 
this  approach  may  have  difficulty  resolving  closely  spaced  sinusoids,  this  limitation 
does  not  cause  a  problem  for  a  basic  analysis  of  the  data. 

The  first  task  is  to  break  the  observed  sequence,  x(n),  into  many  possibly  over- 
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lapping  L  point  long  windows,  Xm{n)^  given  by 

x{mM  +  n)  n  =  0,...,L  —  l 
0  else 

where  M  is  the  distance  in  samples  between  the  begimiings  of  successive  windows 
and  mM  is  the  location  of  the  beginning  of  the  m**  window.  Before  applying  FFT- 
based  spectral  estimation  on  each  window,  we  applied  window  functions  to  reduce 
the  ripples  in  the  estimated  spectrum  due  to  sidelobe  effects  from  using  finite  length 
data  windows.  Some  of  the  windows  that  were  iised  include  the  Hamming,  Hanning, 
Blackman,  and  Rectangular  windows  [2].  These  windows  also  provide  a  degree  of 
spectral  smoothing  that  tends  to  suppress  false  spectral  peaks.  Each  L  point  long 
window  is  multiplied  by  the  chosen  L  point  long  window  function  w{n)  to  form 

!/m(«)  =  Xm{n)w{n)  (4) 

The  squared  magnitude  of  the  FFT  of  t/m(«)  Is  used  as  the  estimate  of  the  power 
spectrum  of  the  window,  given  by 

Sm{k)  =  in,(fc)P  =  E  (5) 

n=0 

N  is  the  length  of  the  FFT  and  should  be  a  power  of  2  in  order  to  be  able  to  use 
the  common  radix-2  fast  algorithms  [2].  We  also  require  N  >  L,  and  H  N  >  L, 
zeros  should  be  appended  to  ym(n)  until  its  length  is  N.  The  frequency  bin  number 
k  corresponds  to  a  discrete-time  frequency  of  ^k. 

We  further  form  a  spectral  matrix  with  columns  consisting  of  the  successive 
Sm{kys  computed  from  the  time  windows.  The  major  advantage  here  is  that  this 
allows  us  to  observe  all  the  frequencies  in  the  data  and  not  just  the  peak  doppler 
frequencies.  This  provides  a  good  3-D  view  of  the  evolution  of  the  doppler  frequen¬ 
cies  (i.e.,  velocities)  over  time  (see  Figures  6  and  7).  False  peaks  are  also  suppressed 
by  averaging  the  columns  in  the  matrix  three  at  a  time  to  achieve  some  degree  of 
temporal  smoothing  in  the  frequency  domain. 
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Figxire  6:  3-D  plot  of  the  evolution  of  doppler  frequency  with  time. 
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Figure  7:  Contour  plot  of  the  evolution  of  doppler  frequency  with  time. 
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3.4  Automatic  Extraction  of  Projectile  and  Wake 

In  section  2.4  we  disc  .,sed  the  characteristics  of  the  wake  and  the  projectile,  hi 
this  section  we  show  how  these  characteristics  are  used  to  separate  the  projectile 
and  the  weike  horn  the  noise  and  how  the  projectile  and  the  wake  are  automatically 
separately  from  each  other. 

When  we  observe  the  raw  data  w'e  notice  that  the  projectile  and  the  wake  have  a 
higher  and  more  constemt  amplitude  than  the  random  noise  in  the  be^nning^ld  at 
the  end  of  the  data.  Therefore,  we  locate  the  maximum  point,  in  the  spectral 
matrix  (Figure  6)  and  assume  it  is  located  on  the  projectile  or  the  wake  (this  has 
always  been  observed  to  be  the  case).  Then,  we  start  comparing  the  maximum  peak 
amphtudes  of  the  Smik)  power  spectrum  estimate  from  each  window  to  a  threshold 
(for  example  Smaxl^  provided  good  results).  The  location  where  the  values  cross 
this  threshold  determines  the  be^nning  of  the  projectile.  All  the  data  before  that 
point  is  considered  to  be  noise. 

A  similar  approach  is  used  to  lo^?/e  the  end  of  the  wake.  However,  '  not 
compare  each  peak  of  the  Sm{k)  individually,  but  rather  we  create  a  test  window 
and  compare  the  maximum  value  of  thi-.  mdow  to  a  threshold  (for  example  Smax!^ 
provides  good  results  ).  Such  a  window  f  ppru^ch  is  necessary  due  to  the  nature  of 
the  wake  data  which  may  drop  below  the  threshold  for  a  short  time.  A  window  wiU 
detect  wether  the  data  is  below  the  threshold  for  a  longer  period  of  time.  Using  the 
two  points  identified  in  this  way  we  extract  the  data  from  between  and  consider  it 
to  be  the  data  for  the  projectile  plus  the  wake  (See  Figure  8). 

As  was  mentioned  earlier,  after  we  convert  the  raw  data  to  velocity  we  observe 
a  null  area  immediately  behind  the  projectile.  This  characteristic  plus  the  fact  that 
the  wake  should  be  traveling  slower  than  the  projectile  provide  a  way  to  extract  the 
wake  data  using  a  frequency  threshold  rather  than  an  amplitude  threshold.  The 
thres*  old  used  here  or  when  the  data  is  being  separated  from  the  noise  is  very 
dependent  on  the  natme  of  the  data  (see  Figure  9,  the  extraction  of  the  wake). 
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Figure  8:  Extracted  projectile  plus  wake  velocity. 
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Figure  9:  Extracted  wake  velocity. 
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4  Results 


We  developed  a  C  program  to  implement  this  spectral  estimation  approach.  This 
program  was  designed  to  be  user-friendly,  and  provides  an  automatic  and  complete 
graphical  analysis  of  the  velocity  behavior. 

The  user  is  prompted  for  a  few  inputs  .*ch  as  the  name  of  the  file  containing  the 
measured  data,  the  projectile  body  diameter,  the  sampling  period,  the  smoothing 
window  preferred,  the  radar  position  and  frequency  and  some  other  necessary  pa¬ 
rameters.  The  C  program  will  then  automatically  extract  the  projectile  and  wake 
from  the  data  and  provide  the  user  with  detailed  graphical  presentations  of  the 
velocity  behavior  of  the  projectile  and  the  wake.  Two  of  these  representations  are 
2-dimentional  plots  of  velocity  versus  time.  As  mentioned  earlier  the  velocities  are 
normalized  to  the  maodmiun  velocity  of  the  projectile  and  the  time  is  converted  to 
distance  in  terms  of  projectile  body  diameter.  The  other  two  graphical  represen¬ 
tations  consist  of  the  3-dimensional  plot  of  velocity,  time,  and  intensity,  and  the 
contour  plot.  An  example  of  these  fom  plots  is  given  in  Figure  10. 

These  graphical  presentations  provide  the  user  with  revealing  views  of  the  data 
behavior.  The  program  is  also  capable  of  comparing  the  results  of  a  test  done  using 
three  radars  each  with  a  different  frequency.  The  output  will  be  a  set  of  eight 
plots,  fom:  for  the  projectile  plus  weike  and  four  for  the  wake  only.  This  capabilitiy 
allows  the  comparison  of  velocity  behavior  for  different  frequencies  when  performed 
on  the  same  test.  This  technique  can  also  be  used  to  compare  the  data  for  the 
same  frequency  but  from  the  In-phase,  Quadrature  and  complex  (In-phase  and 
Quadrature  data  combined)  outputs  (See  Figure  11). 


5  Conclusion 

The  proposed  approach  can  be  used  for  automated  analysis  of  doppler  radar  data. 
As  mentioned  in  section  4,  a  relatively  small  number  of  initial  parameters  are  needed 
to  provide  an  automated  graphical  analysis  of  the  velocity  of  the  projectile  aind  its 
wake.  The  proposed  means  of  visualizing  the  results  through  3-D  surface  plots 
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Figure  10:  Graphical  presentation  of  results. 
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Figiu-e  11;  Spectral  estimation  comparison  for  3  sots  of  data. 
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and  contour  plots  provide  a  more  complete  understanding  of  the  data.  Future  work 
may  include  the  use  of  higher  resolution  and/or  nonstationary  spectral  analysis  tech¬ 
niques  such  as  auto-regressive  modehng  and  time-frequency  distribution  techniques 
such  as  the  Wigner-Ville  transform  that  hold  promise  of  increasing  the  accuracy  of 
the  frequency  estimates. 

Other  applications  should  be  studied  such  as  in-barrel  doppler  data  or  3-D  imag¬ 
ing  of  the  wake  using  multiple  radars.  The  latter  application  is  a  Phase  Interfer¬ 
ometer  that  may  be  used  to  characterize  qualities  of  the  wake  such  as  its  diameter 
and  the  density  of  free  electrons. 
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Abstract: 


New  control  techniques  are  needed  to  keep  pace  with  the  development 
of  technology  in  altitude  test  cells.  A  recently  proposed  technique  for  these 
controls  is  a  model-following  controller  using  an  inverse-process  model,  but 
there  may  be  room  for  improvement  in  the  accuracy  of  the  inverse-process 
model.  During  this  effort,  an  extended  Kalman  filter  was  developed  to  reduce 
errors  in  the  inverse-process  model. 

Introduction: 

As  simulated  altitude  tests  are  required  to  become  more  necessary  and 
economically  desirable,  the  systems  required  to  control  these  test  cells  are  also 
having  to  become  more  accurate. [1]  The  technology  and  techniques  now 
available  allow  more  realistic  modelling  and  estimation  of  real-world 
conditions  in  test  cells.  Of  special  interest  is  the  performance  testing  of 
engines  under  transient  conditions.  With  costs  of  flight  testing  becoming 
much  more  expensive,  economical  alternatives,  especially  ground  testing, 
become  very  important  considerations. 

The  most  common  controller  presently  is  a  PID  controller,  sometimes 
gain  scheduling  is  added  to  the  PID  controller  to  increase  its  flexibility  [1].  A 
recent  paper  by  Chaney  [1]  proposed  an  improved  technique  for  controlling 
tests  in  these  ceils.  The  paper  found  a  model  following  controller  to  be  the 
most  flexible  and  accurate  form  of  control  in  chis  situation.  The  control 
consisted  of  a  second  order  linear  model  generating  smooth  trajectories  for 
each  setpoint,  using  measurements  or  estimates  of  process  states  and 
parameters  in  an  inverse  process  model  to  establish  the  control  inputs  to  the 
plant.  The  controller  compensates  for  nonlinear  plant  dynamics  by 
linearizing  and  decoupling  the  system.  Perfect  modelling  is  the  goal,  but 
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tracking  errors  in  the  system  develop  from  unmodelled  dynamics  or 
parameter  errors  in  the  feedforward  controller.  [1] 

One  source  of  error  results  from  errors  in  the  observation  and 
estimation  of  the  necessary  parameters  for  the  inverse  process  model.  These 
errors  con  be  reduced  by  employing  an  extended  Kalman  filter  observer  to 
estimate  these  parameters  from  the  observed  data.  Such  a  Kalman  filter  was 
developed  during  this  effort. 

The  next  section  provides  a  general  discussion  of  the  problem  at  hand. 
The  sections  following  provide  a  general  procedure  for  applying  an  extended 
Kalman  filter  to  this  problem  and  a  specific  procedure  for  developing  the 
Kalman  filter  is  given.  A  results  section  then  shows  the  performance  of  the 
Kalman  filter  against  simulated  data. 

Problem  Discussion: 

Our  task  was  to  design  an  extended  Kalman  filter  to  observe  the 
important  parameters  of  a  altitude  test  cell.  The  Kalman  filter  observer  will 
become  part  of  a  controller  for  the  test  cell.  [1] 


Figure  1  -  Model  Following  Controller  with  Kalman  Filter 
The  controller  shown  in  Figure  1  is  a  model-following  controller  that 
includes  a  feedforward  inverse-process  model  to  translate  the  setpoint  into 
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the  proper  inputs  to  the  test  cell.  The  Kalman  filter  is  necessary  to  provide 
high  quality  estimates  of  the  important  parameters  so  that  the  feedforward 
inverse-process  model  will  perform  accurately.  The  test  cell  under 
consideration  is  shown  in  Figure  2. 


Figure  2:  Scl -Jinatic  of  Test  Cell 
The  important  parameters  of  .fie  test  cell  are: 


Pl(t) 

pressure  in  the  inlet  pies  'm 

Tl(t) 

temperature  in  the  inlei  plenum 

Vi  i(t) 

volume  flow  rate  into  the  test  article 

heat  flow  rate  out  of  the  ‘nlet  plenum 

Tw(t) 

temperature  of  the  inlet  p'enum  wall 

Vc(t) 

volume  flow  rate  of  cold  au 

Vh(t) 

volume  flow  rate  of  hot  air 

Tc 

temperature  of  cold  air  supply  (assumed 
constant) 

Th 

temperature  of  hot  air  supply  (assumed 
constant) 

Accd 

valve  area  ratio  for  cold  air 

Ahcd 

valve  area  ratio  for  hot  air 

AP 

difference  between  total  and  static 
bellmouth  pressures 

Pc 

pressure  of  cold  air  supply  (assumed  constant) 

Ph 

pressure  of  hot  air  supply  (assumed  constant) 

Table  1  -  Parameters 


The  nonlinear  differential  equations  governing  the  system  are: 
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Pl(t)  = 
fl(t)  = 


^^^Vh(t)  +  Vc(t)-Vn(t)] 


Tl(t) 


VI 


Vh(t)  V 


Tl(t) 
'  Th 


+  Vc(t) 


(Y  -  1)  + 


Q(t) 

Pl(t) 


(Y 


Pl(t) 

Tl(t)l-76 


As  (T\v(t)  -  Tl(t)) 


Pl(t)- AP  vfl(t)  V  R(y-  1) 


Pl(t)  -  AP 

Pt(t)  1 


|pi(t)  -  APj 
\  Pl(t)  ) 


Tvv(t)  = 


Q(t) 

3600  Cm  M 


Vh(t)  = 


R  Th  Ph  Ahcd 
PUO 


(t._RTc  PcAccd  ,  r~2 
Pl(t)  fr;  V  R  (Y  -  1) 


where  the  meanings  of  the  variables  are  given  in  the  following  table: 


Ch 

Acd 

b 

As 

Cm 

R 

M 


ratio  of  specific  heats _ 

collection  of  constants _ 

bell-mouth  throat  area _ 

empirical  constant _ 

inlet  plenum  effective  surface  area 
heat  capacitance  of  the  metal 
gas  constant _ 


mass  of  the  metal _ 

Table  2  -  Constants 


Procedure: 

Two  different  formulations  of  these  differential  equations  were  tried  in 
order  to  strike  a  good  balance  of  accuracy,  utility,  and  computational 
complexity.  Both  formulations  required  extensive  symbolic  differentiation  in 
order  to  derive  and  linearize  the  system  model  before  implementing  the 
Kalman  filter.  This  symbolic  differentiation  was  too  extensive  to  handle  by 
hand,  therefore  a  symbolic  mathematics  package,  Mathematica,  was  employed 
to  compute  the  derivatives.  Each  formulation,  however,  followed  the  same 
basic  procedure.  [2]  That  basic  procedure  can  be  described  as  follows: 

1.  From  the  important  parameters  of  the  system  model  equations, 
select  a  set  of  these  parameters  (and  their  time  derivatives)  to  be  the 
state  variables  of  the  system.  Select  the  remaining  important 
parameters  to  be  inputs.  This  results  in  the  nonlinear  state 
equations 

x(t)  =  f(x(t).  u(t)) 

where  x(t)  is  the  vector  of  state  variables  and  u(t)  is  the  vector  of 
input  variables.  Also  the  vector  of  observations  became 
z(t)  =  HTx(t) 

where  the  transpose  of  H  selects  the  observable  state  variables. 

2.  The  equations  for  the  time  derivatives  of  the  state  variables  define 

nonlinear  differential  equations  describing  the  dynamics  of  the 

system.  Discretize  these  equations  using  the  Euler  approximation  to 

the  derivative 

i(t)«;^[x(t+Ts)-x(t)] 

^  S 

This  results  in  a  set  of  nonlinear  difference  equations  of  the  form 
x((k+l)Ts)  =  x(k  Ts)  +  Ts  f(x(k  Ts),  u(k  Ts)) 
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3.  These  nonlinear  difference  equations  are  then  linearized  about  the 
state  variables  and  the  input  variables  to  obtain  a  set  of  linear  time- 
varying  difference  equations.  This  will  be  described  in  more  detail 
later  in  this  report. 

4.  The  linearized  difference  equations  are  used  to  produce  the 
extended  Kalman  filter. 

The  difference  in  the  formulations  derives  from  choosing  different  sets  of 
state  variable  *  step  1.  above. 


Results: 

For  the  first  realization,  the  state  vector  x  was  selected  as 

■Pl(t)' 

Tl(t) 

0(0 

x(0=  V,i(0 

Tw(t) 

Vh(0 

.Vc(0. 

and  the  input  vector  u  was  selected  as 

■  Ph(o  ■ 

Pc(0 
Th(0 
u(0=  Tc(0 
Ahcd(0 
Accd(0 
.  AP(t)  . 

This  formulation  required  that  time  derivatives  of  some  of  the  original 
nonlinear  equations  be  taken.  In  particular  we  took  time  derivatives  of  the 
equations  for  Q(t),  Vj  i(t),  Tvv(t),  Vh(t)  and  Vc(t)  to  obtain  equations  for  0(0,  Vi  i(t), 
Tw(t),  Vh(0,  and  Vc(0.  These  derivatives  resulted  in  very  large  and  complex 
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nonlinear  differential  equations.  The  resultant  vector  of  nonlinear 
differential  equations,  f,  is  given  by 


f(x(t),  u(t))  = 


’PKO" 
fl(t)  1 
«t)  I 
Vi,(t)  I 
Tw(t)  I 


Vh(t)  I 

_Vc(t)J 


These  nonlinear  equations  had  to  be  discretized  and  then  linearized  to  form  a 
set  of  linear  state  equations  in  order  to  apply  the  Kalman  filter  equations.  [?] 
The  discretization  follov/ed  the  approach  in  step  2  above.  After  dropping  the 
fs  notation  in  the  arguments  of  x(k  Tg)  and  u(k  Tg)  we  obtain 
x(k+l)  =  x(k)  +  Ts  f(x(k).  u(k)) 

The  linearization  was  carried  out  by  t'>i'*ng  derivatives  with  respect  to  x  to 
form  a  state  transition  matrix  F  and  with  respect  to  u  to  form  the  input 
coupling  matrix  B.  The  linearized  state  equations  were  thus  given  by 
x(k+l)  =  F(k)  x(k)  +  B^k)  u(k) 


where 


F(k)  = 

— ^x(k)  +  Ts  hx(k),  u(k))] 

0x(k) 

B(k)  = 

,x(k)  +  Ts  L(x(k),  u(k))] 

3u(k) 

The  measurement  equation  was  given  by 


z(k)  =  H'*'  x(k)  +  Y(k) 


where 


H  = 


'"1  0  0' 
0  1  0 
000 
000 
00  1 
000 
0  0  0. 
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and  v^(k)  represents  uncertainties  (i.e.  noise  and/or  error)  in  the 


measurements. 

The  Kalman  filter  variables,  summarized  in  the  table  below,  were 
given  initial  values  so  that  the  filter  could  run. 


Sp 

predictive  noise  covariance  matrix 

Sc 

corrective  noise  covariance  matrix 

H 

output  coupling  matrix 

Rn 

output  noise  covariance  matrix 

Qns 

input  noise  covariance  matrix 

z 

readings  from  actual  test  cell 

Xp 

predictive  state  vector 

Xc 

corrective  state  vector 

_ 3 

Table  3  -  Kalman  Filter  Variables 


The  Kalman  filter  operated  according  to  the  following  equations.  The 
Kalman  filter  first  calculated  the  Kalman  gain  L  [2] 

Uk)  =  Sp(k)  H  [HT  Sp(k)  H  +  R„(k)]'‘ 

The  corrected  noise  covariance  matrix  Sc  was  then  calculated  according  to  [2] 

Sc(k)  =  Sp(k)  -  Uk)  H^SpCk) 

The  vector  of  measurements  z  and  the  input  vector  u  were  formed  by  reading 
the  following  values  from  the  data  files  (of  course,  in  a  real-time 
environment  these  would  come  from  the  sensors  directly). 


z  = 


Plsim(k)l 
Tlsim(k)  1 
Twsim(k)j 


u  = 


I  I 

I  Th 

!  Te 

I  Ah(k) 

i  Ac(k)  i 

LAP{k)  I 


The  pressures  and  temperatures  of  the  supply  sources  Ph,  Pc/  Th,  and  Tc  are 
assumed  to  be  constant. 

The  corrected  state  estimate  Xc(k)  (corrected  using  the  measurements)  was 
given  by 

Xc{k)  =  Xp(k)  +  L(k)  [z(k)  -  Xp(k)J 

The  values  from  ^  were  used  to  evaluc  id  B  at  time  k  since  F  and  B  are 
functions  of  x(k)  and  u(k)-  F  and  B  were  then  used  to  calculate  the  one-step 
predicted  noise  covariance  matrix  Sp(k). 

Sp(k)  =  F(k)  Se(k)  FT(k)  +  B(k)  Qns  B'^Ck) 

The  prediction  of  the  next  value  of  eacn  of  the  state  variables  was  made  from 
the  previously  derived  discrete  nonlinear  equations. 

'  Plm(k+1)~ 

TIm(k+l) 

Qm(k+1) 

Xp(k+1)  =  Vi  ,m(kfl)  =  Xc(k)  +  Tj  f(Xc(k),  u(k)) 

Twm(k+1) 

Vhm(k+1) 

_  Vcm(k+1)  _ 

This  process  repeated  over  all  the  points  in  the  simulation  data. 

The  model  from  above  was  used  with  the  derivatives  of  the  equations 
for  five  of  the  state  variables.  The  filter  worked  very  well  when  simulated  in 
the  Mathematica  package,  but  the  equations  for  the  F  and  B  matrices  were 
completely  intractable  for  coding  into  FORTRAN  (the  equations  describing 
the  elements  of  the  F  and  B  matrices  required  160  pages  to  print  out), 
therefore  a  simpler  model  was  developed.  Instead  of  using  Q,  V]  i,  Tvv,  Vj,  and 
Vg  as  the  slate  variables,  their  integrals  Q,  Vn,  Tw,  Vh,  and  Vc  were  used. 
This  simplified  the  equations  greatly,  since  the  need  for  additional 
differentiation  (to  obtain  Q(i),  Vii(i),  tvv(t),  Vh(t),  and  Vc(t)  in  the  previous 
formulation)  was  eliminatea.  The  model  worked  well  in  this  simplified 
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form,  but  had  problems  tracking  the  wall  temperature  Tw.  This  was 
apparently  due  to  the  fact  that  the  diagonal  element  of  Sp  that  allows 
correction  from  the  sample  of  Tw  was  becoming  very  tiny,  causing  very  little 
of  the  correction  to  affect  Tw.  When  the  engine  began  a  maneuver,  Tw 
would  track  poorly  because  the  Kalman  gain  for  Tw  was  too  tiny  to  produce 
much  correction.  We  solved  the  problem  by  adding  a  value  of  0.5  to  this 
element  of  Sp  at  every  iteration.  This  resulted  in  a  higher  Kalman  gain  for 
Tw  thus  allowing  the  sample  of  Tw  to  provide  more  correction  to  the  value 
of  the  predicted  Tw  in  2^(k).  See  the  graphs  provided  in  the  appendix. 

Conclusion: 

The  developed  extended  Kalman  filter  tracked  very  well,  but  several 
topics  need  to  be  addressed.  Future  research  should  include  the  following 
topics: 

1.  Integration  of  the  FORTRAN  implementation  of  the  extended 
Kalman  filter  into  existing  simulation  and  control  software  to  test 
performance  more  rigorously  and  evaluate  whether  the  accuracy 
level  is  acceptable. 

2.  Evaluate  whether  the  present  implementation  is  fast  enough  for  a 
real-time  control  situation. 

3.  Investigate  faster  implementations  using  multiprocessors  and/or 
high  speed  processors. 

4.  Investigate  the  stability  and  sensitivity  of  the  extended  Kalman 
filter  to  disturbances  and  modelling  error. 

The  results  of  this  effort  were  very  encouraging  and  strongly  suggest  that  the 
use  of  an  extended  Kalman  filter  can  improve  the  accuracy  of  the  inverse- 
process  model  in  the  model-following  controller. 
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The  graphs  below  show  the  actual  data  from  PI,  Tl,  and  Tw  plotted 
with  the  predicted  values  of  PI,  Tl,  and  Tw.  The  plots  for  Q,  V n,  Vh,  and  Vc 
show  the  behavior  expected  of  the  integral  of  the  quantities  of  interest. 
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Tw  and  Twsmi 
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ABSTRACT 

New,  non-protic  N-nitration  methods  were  explored  using  nitronium  triflate, 
NO2OSO2CF3  (N020Tf),  generated  in  situ  from  tetrabutylammonium  nitrate,  BU4NNO3, 
and  trifluoromethanesulfonic  anhydride,  (CF3S02)20  (Tf20).  2-Pyrrolidinone, 
pyrrolidine,  N-acetyl-pyrrolidine,  N-acetyl-piperidine,  N-acetyl-4-piperidinone, 
imidazolidinone,  succinimide,  and  hydantoin  were  nitrated  using  these  methods.  Aqueous 
and  non-aqueous  workups  gave  yields  ranging  from  20-76%.  The  reaction  methods  are 
significantly  improved  and  eliminate  the  hazard  potential,  catalytic  and  solubilizing 
necessity  and  reaction  workup  requirements  of  nitromethane  as  a  reaction  solvent 
previously  reported.  Investigations  were  performed  at  the  F.  J.,  Seiler  Research 
Laboratoiy,  AFSC  FJSRL/NC,  U.  S.  Air  Force  Academy,  Colorado  Springs,  Colorado. 


The  nitration  of  heterocyclic  compounds  is  of  great  importance  to  several  fields  of 
chemistry.  Of  primary  interest  to  the  armed  services  is  the  use  of  nitro  compounds  in 
explosives  and  propellants.^*^  The  formation  of  these  compounds  in  large  quantities  is  of 
prime  importance  for  their  production  however  m  the  laboratory  more  convenient  methods 
of  nitration  are  required  in  order  to  form  trial  compounds  for  study.  The  use  of  often  harsh 
conditions  exemplifies  their  general  bulk  syntheses  and  many  investigators  who  are 
interested  in  the  decomposition  and  stability  of  the  compounds  search  for  milder  synthetic 
conditions  for  smaller  quantities.  These  conditions  would  allow  modification  of  the  uial 
compounds  and  the  incorporation  of  more  sensitive  functionalities.  Thus,  there  are  several 
areas  to  be  addressed  in  this  study: 

1)  Investigate  new  alternative  methods  for  the  formation  of  nitronium  triflate, 
NC)20S02CF3,  (N020Tf),  from  tetrabutylammonium  nitrate,  BU4NNO3,  and 
trifluoromethanesulfonic  anhydride,  (CF3S02)20. 

2)  Investigate  non-protic  nitration  methods. 
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3)  Increase  the  reaction  yields  and  N020Tf  solubility  in  the  reactions. 

4)  Investigate  N-nitration  versus  C-nitration  possibilities. 

5)  Eliininate  the  catalytic  necessity  or  solubilizing  ability  of  nitromethane. 

6)  Eliminate  the  hazard  potential  and  reaction  workup  requirements  of 
nitromethane. 

7)  Determine  whether  a  base  is  required  for  intermediate  formation  in  the  nitration 
mechanism. 

8)  Retain  CH2CI2  as  a  reaction  solvent  for  other  reactions  and  then  substitute 
alkane  solvents  for  reactions  which  cannot  be  run  in  chlorinated  solvents. 


The  heterocyclic  compounds  (1  -  8)  and  their  corresponding  nitro  derivatives  (9  - 16)  used 
or  synthesized  in  this  study  are  given  below: 
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Nitronium  Source 

The  introduction  of  nitro  groups  into  organic  molecules  has  been  done  using  a 
variety  of  methods.^'^  Most  methods  require  strongly  acidic  conditions,  normally  using 
oxy-acids.  The  strongly  acidic  conditions  react  with  the  often  sensitive  functionalities, 
limiting  application  of  the  reactions  in  specific  compound  syntheses.  These  conditions  also 
induce  polymerization  making  the  method  not  applicable  to  polymer  precursors  for 
energetic  binders.  Recently,  the  use  of  trifluoroacetic  anhydride  and  ammonium  nitrate  in 
pure  nitromethane  as  a  solvent  has  been  reported  by  Suri  and  Chapman.^  These  conditions 
advanced  the  use  of  more  convenient  methodology  for  nitration  but  suffered  from  the 
requirement  of  nitromethane  as  a  solvent.  While  nitromethane  is  required  to  solubilize  the 
starting  reagents  and  the  subsequent  nitronium  salts.  Major  Scott  A.  Shackelford,  at  the  F. 
J.  Seiler  Research  Laboratory  at  the  L'l  Air  Force  Academy,  postulated  the  use  of 
tetrabutyl  ammonium  nitriate  as  the  nitrate  source  due  to  its  increased  solubility  in 
methylene  chloride  and  the  use  of  trifluoromethanesulfonic  anhydride  as  the  nitrate 
deoxygenating  agent  due  to  the  better  leaving  ability  of  triflate  anion. 

There  are  several  advantages  in  the  use  of  tetrabutylammonium  nitrate.  The 
quaternary  ammonium  salt  is  not  a  proton  source.  The  quaternary  ammonium  cation  can 
act  a  a  phase  transfer  agent  solubilizing  nitrate  and  remove  the  requirement  for  nitromethane 
in  known  nitration  reactions.  And,  the  workup  may  be  easier  as  the  ammonium  salt  can  be 
removed  by  chromatography. 

Heterocycle  Investigations 

'ilte  structures  of  the  heterocycles  to  be  used  in  this  study  vary  considerably  in  the 
chemical  availability  of  the  nitrogens  for  nitration.  On  examination  there  is  a  secondary 
amine  (2),  four  amides  (1, 3, 4, 5),  one  imide  (7),  one  imidazole  (6)  and  one  oxazole  (8). 
The  placement  and  affect  of  the  carbonyl  groups  can  contribute  to  the  availability  of  the  free 
electron  pair  on  the  nitrogen.  How  do  these  structure  variations  affect  the  reactivity  and 
yield  of  the  nitration  reactions?  Do  these  variations  show  up  in  different  isolation 
techniques?  Can  these  these  compounds  act  as  a  base  in  the  reactions,  i.e.  neutralize  the 
reaction  using  the  substrate,  and  does  the  addition  of  a  weak  base  assist  in  the  nitration 
reactions? 
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2>Pyrrolidinone  (1)  Investigations:  Formation  of  l-Nitro-pyrrolidinone  (9) 


Table  1.  Synthesis  Reactions  Producing  1-Nitro-PyrroIidinone  (9). 


1  9 


Entry 

Solvent  of 
Generation 

T  (’C) 

Salt  R  of 

1  R4NNO, 

Ratio 

Rgt/Subs 

Solvent  of 
Rxn 

'C,  time 

Yield 

Run 

1 

CHjClj 

•20 

H 

1:1 

CHjClj/NajSO, 

,  -20,  1  hr 

12 

lAl-1 

2 

•20,  1  hr 

13 

lBl-9 

3 

CHjCIj 

•  20 

H 

2:1 

CHjClj/NajSO 

4  -20,  1  hr 

16 

2A1.3 

4 

CHjNO, 

•  20 

H 

1:1 

CHjClj/NaiSOe  -20,  1  hr 

31 

1D1.32 

5 

CHiClj 

•  20 

Bu 

1:1 

CHjClj 

•20,  1  hr 

43 

6A1.20 

6 

CHjCli 

•20 

Bu 

1:1 

CHjClj/NajSO 

«  -20,  1  hr 

38 

8A1-25 

7 

CHjClj 

•20 

Bu 

2:1 

CHjClj 

•20,  1  hr 

41 

7A1-23 

8 

CHjNOj 

•20 

Bu 

1:1 

CHjClj 

•20,  1  hr 

45 

11A1.30 

9 

CHjClj 

0 

Bu 

1:1 

CHjCi, 

0.RT,  17  hr 

56.5 

16A1-42 

10 

CHjClj 

0 

Bu 

1:2 

CHjClj 

RT,  17  hr 

63(Tf)» 

20A2-12 

31.5(PyrV 

> 

11 

CHjCli 

0 

Bu 

1:1 

CHjClj 

RT,  1  hr 

25.3 

24A2-20 

12 

CHjClj 

0 

Bu 

1:1 

CHjClj 

RT,  1  hr,  Naq.  o 

27A2.38 

•  Yield  of  63%  based  on  nitration  reagent  Yield  of  31.5%  based  on  2-pynoUdinone. 


Pyrrolidinone  (1)  has  been  used  as  a  standard  for  amide  nitrations.  Literature 
reports  on  pyrrolidinone  nitrations  using  a  variety  of  reagents  range  from  30  to  56%.^ 
Using  trifluoracetic  anhydride  and  ammonium  nitrate  Suri  and  Chapman  report  a  yield  of 
30%  for  9. 

To  establish  a  standard  ammonium  nitrate,  NH4NO3,  and  tetrabutylammonium 
nitrate,  Bu4NNC)3,  were  allowed  to  react  with  trifluoromethanesulfonic  anhydride,  triflic 
anhydride  (Tf20),  in  pure  methylene  chloride  to  generate  nitronium  triflate,  equation  1.  As 
expected  the  insolubility  of  the  NH4NO3  in  methylene  chloride  gave  low  yields,  13%, 
Table  1.  However,  the  more  soluble  BU4NNO3  gave  a  signifcant  increase  to  43%.  Use  of 
Na2S04  as  a  weak  base  to  neutralize  any  acid  produced  when  NH4NO3  is  the  nitrate  source 
showed  no  significant  increase  in  yield,  16%.  When  Na2S04  was  used  in  the  reaction  with 
BU4NNO3  as  the  nitrate  source  the  yield  was  comparable  to  that  without  Na2S04, 38%. 
This  would  suggest  that  the  protons  derived  from  the  ammonium  cation  or  the  amide  do  not 
interfer  with  the  nitration  reaction. 
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•0'^  ^0- 


TfiO 


0 

li  jj 

•O'^  '^O — S — CFj 

II 

o 

+  Bu4N0Tr 


TfO'  +  NO2* 


(1) 


To  determine  whether  nitromethane  serves  as  a  catalyst  and/or  just  a  solubilizing 
agent,  each  reagent  was  examined  using  8  equivalents  of  nitromethane  The  use  of  8 
equivalents  was  experimentally  determined  as  this  was  the  minimum  amount  of 
nitromethane  required  to  stir  tlie  reaction  mixture  during  formation  of  the  nitronium  triflate. 
When  the  substrate  was  added,  methylene  chloride  was  used  to  help  solubilize  the  reaction 
mixture.  Using  NH4NO3,  Tf20,  and  Na2S04  the  yield  was  31%  which  is  comparable  to 
using  (CF3C0)20  and  NU4NO3  in  pure  nitromethane  solvent.  It  would  appear  that 
nitromethane  serves  a  solublizing  function.  This  also  compares  favorably  with  the  initial 
reaction  run  by  Major  Shackelford  at  FJSRL.  When  BU4NNO3  was  substituted  as  the 
nitrate  source,  using  8  equivalents  of  nitromethane  to  initially  generate  the  nitronium 
triflate,  the  yield  of  9  was  increased  to  45%.  This  can  represent  a  solubilizing  contribution 
by  nitromethane  on  the  nitronium  triflate  salt  generated  prior  to  nitration. 

It  has  been  suggested  by  Suri  that  the  substrate  may  participate  as  a  base  in  the 
reaction  mixture.^  If  one  equivalent  of  reagent  is  taken  up  by  the  acid  produced  after 
nitration  then  the  overall  yield  would  be  reduced.  Major  Shackelford  postulated  that  a 
twofold  addition  of  nitronium  ion  to  pyrrolidinone  may  occur,  equation  2.  Thus  a 
comparison  was  done  using  Bu4i’Jr403  and  doubling  the  initial  substrate  concentration. 
Conversely,  a  comparison  was  made  doubling  the  reagent  concentration.  When  the 
substrate  is  doubled,  and  NC)2Tf  is  the  limiting  reagent,  the  substrate  can  participate  more 
effectively  as  a  base.  On  comparison  of  entry  9  (56.5%),  and  10  (63%),  onlv  a  slight 
increase  in  yield  is  observed,  insufficient  for  participation  of  the  substate  as  a  base.  When 
the  reagent,  N02Tf,  is  doubled,  compare  entry  6  (38%),  and  7  (41%),  little  difference  is 
observed.  This  would  suggest  that  participation  of  a  second  N02Tf  in  the  reaction  does  not 
occur.  When  one  compares  the  different  nitrate  salts,  the  solubility  of  the  salts  when  the 
reagent  is  doubled  exhibits  more  of  an  effect  than  the  concentration  change. 
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AnieacPocs;ine«>ccritTeof!l8ermaibaB^ienipaaaflcs.waealfe>^^ 
lEiofff  to  foim  file  nfironiiimiriflaieieafem  prior  Eoaddinmi  of  the  sabscaie.  A 
cooiparisoa  of  inculadoo  times  and  fields  following  soiisirate  a^iioa  wQi^  dies  dicw  a 
fiinctioo  of  die  amoiint  of  nitroohmimflaie  framed  duiing  the  1  hour  period.  Asamai^ 
of  note,  die  nkronium  niflaie  ^ipeaied  to  be  a  white  insoluble  salt  in  an  leacdon  mixtines, 
as  evidenced  the  fonnadon  of  what  ^ipeaied  to  be  a  white  solid  and  a  tfaideening  of  the 
icacdonmixtaie:  When  identical  leacdooswTieaDowed  to  pcoceed  at  loom  ten^ieratuie 
CRT)forl  boor  and  18  hours  foUowing  substrate  addidrat^enny  11  (25%)  and  9  (57%) 
shewed  that  longer  incubadoo  dmes  would  ggni  ficamly  increase  the  leacdon  yield.  This 
nay  be  a  consequence  of  solid  nitromum  Inflate  being  consumed  and  ledissohred  as  die 
teacdon  is  aUowed  to  {sooeed. 

The  use  of  a  nrai-aqueous  woikup  in  the  nilradon  of  2-pyirolidinoce  did  not  ^'e 
tfaedesiicdN-nitn>'2-])ynolidinraie.  This  result  is  in  total  disagreement  with  die  repotted 
isoladon  by  Suii  and  Chajfinan,  but  agrees  with  the  observanrais  inidany  observed 
Majra  Shackelford  and  Dr.  Qay  Sharts  at  FJSRL.^  Following  conceniradon  and 
chromatography  a  pale  yeUow  impure  solid  was  obtained.  This  solid  was  insoluble  in 
CDQ3,  however  an  NMR  in  DMSO  showed  a  cranplex  mixture.  Nc  starting  material  or 
product  signals  were  observed.  Signals  at  S  11.97  and  5  6.08  may  indicate  an  acid  and/ra 
an  N-H  functionality  and  peaks  at  8  437, 2.55,  and  230  appear  close  to  values  for 
buyrolactone.^  How'ever,  an  opoi  chain  add  or  amine  cannot  be  eliminated,  see  equation 
3- 

Pyrrolidine  (2)  Investigations:  Formation  of  l-Nitro-pyrrolidine  (10) 

Table  2.  Synthesis  Reactions  Producing  NNitro-pyrrolidine  (10) 

C  o 

\  H 

N — X‘ 

^  V 

2  10 


m 

Solvent  or 
Generation  T  (’C) 

Salt  R  of 
Kel'NOi 

Ratio 

Rst/Subs 

Scirent  of 
Rxn 

'C,  time 

Yield 

Run 

1 

CHtClj  -20 

H 

1:1 

CHiCI, 

-20,  1  hr 

51.4 

3A1-5 

2 

CIljCIj  .20 

Bu 

1:1 

CH1CI2 

-20,  1  hr 

43 

9  A 1-27 

3 

CHjCI,  0 

Bu 

1:1 

CHjCI, 

RT,  I  hr 

38.8 

25A2-28 

4 

CH-Sh  0 

Bu 

1:1 

CH2CI1 

Rt,  17  hr 

33 

25B2-30 
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I^indidhse  (2)  is  a  secondaiy  2n^  wiiboot  ibe  caibcayl  assodaied  wdi  &e 
niiw^en  for  conju  satioa.  ^Ecsmnnoifkisdiah'edfiomBasNNOjandtnflicsnhjidnde 
wasusedtoe3q}k)!et!iecia^ostsofin'vkc*aicc^!e.Ta!^2.  \Vbea  NHjN03  and 
Ba$^C^«%icaScmiediO!eact,cofi^[»!eect!y  1  (Sl%)and2(43%Xthe!esiil!ssi!g§£st 
diatdseainnmaiiminiaaieinaybebeGerdianibefciiabu^'lannnoaiiimsaiL  Thisicsoltiny 
belowoKtadaingibeodKricac^i^exploied  When reaction KnqKiances are 
increased  ibe^e  is  IK)  ssniScantinctsase  in  Inncop^iiDlidinefonnanofL  When  die  reaction 
dme  is  increased  there  may  Ik  a  sli^t  lowenng  cf  the  reacdiMi  yield  but  diis  may  be  whhin 
ciror.  A  qaesdon  of  wfaedK?  die ':aibonyi  contributes  to  the  teaedvi^  of  die  nitnMuum 
tiiflaieascon^aredto2-]^iTclidiscnccanbese^.  The  carbonyl  increases  the  yield  of  die 
ninadonsu  An  anomaly  seems  ro  be  vntht;<iipenttnTe  of  the  idnoniumtiiflaKfrxinadon. 

I'Acetyi-pyrrolindine  (3):  Formation  of  l>Nitro>pyrrolidine  (iO) 

Table  3.  Sjatfaesis  Reactioas  ProdsciBg  l-Nitro-pTfrolidiBe  (10) 


3 

It 

EsUt 

SolTOIt  of 
Gcscratioa  T  ('O 

Sdtilor 

KcNN'O, 

Ratio 

RetiSobs 

Solreat  of 

Ru 

'C,  tinie 

Yield 

Rd« 

1 

CHtCI.  • 

Ba 

1:1 

CHiai 

RT.  Ihr 

47.» 

Z9A2-40 

2 

CHjCl.  • 

Bo 

1:1 

CHjCtj 

Rt,  »  br 

47.t 

Z9B2-4Z 

Most  nitrations  used  in  the  fomiation  of  HMX  or  RDX  occur  by  displacement  of  an 
acetylated  nitrogen,  an  amide.  In  l-acetyl-pyirolidine  (3)  the  amide  and  carbonyl  are 
substituted  in  a  way  as  to  be  eliminated  upon  conversion  to  nitrated  product.  Table  3. 
Hence,  if  the  amide  bond  in  2-pytrolidinone  is  broken  an  ucid  may  be  formed,  equation  3 
The  product  may  subsequently  be  eliminated  in  the  workup  or  by  chromatography. 
(Comparison  of  pyrrolidine  with  N-acetylpyrrolidine  suggests  that  the  carbonyl,  in  the  form 
of  an  acetyl,  increases  the  yield  of  the  nitration.  This  may  be  a  consequence  of  the 
intermediates  and  subsequent  breakdown  products  of  the  reaction  or  geometric  restrictions 
enforced  by  cyclized  starting  material.  When  such  products  are  intramolecularly  bound 
where  R  is  attached  to  R'  in  equation  3,  then  the  nitration  products  could  be  removed  in  the 
workup. 
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I'Acetjl'piperidine  (4):  FonnaticHi  of  i-Nitro>piperidine  (11) 

Table  4.  SjBthests  Reac^as  Prodaciag  l-Ntlro-piperidine  (11). 


XOiOTT 


4  11 


Eatrr 

Soivnit  of 
Generation 

Sdt  Ror 
TCC.i  E<N.XO, 

Ratio  Solvent 
Ret/Snbs  of  Rxn 

'C,  time 

Yield 

Ron 

1 

CHjCI. 

e 

Bd 

1:1 

CHjCI, 

RT,  1  hr 

69 

23A2-16 

2 

CHiCU 

0 

Bo 

1:1 

CH,Clj 

RT,  16  hr 

72.3 

22B2-22 

3 

CH,CU 

0 

Bo 

1:1 

CHiCU 

RT,  1  br,  Xaq 

53.6 

33A3'10 

The  fonmtion  of  1-nitro-piperidine  (11)  from  1 -acetyl-piperidine  (4)  parallels  that 
of  l-nitro-pyiTolidine,  Table  4.  The  major  difference  is  the  size  of  the  ring  in  the  acetylated 
starting  material.  The  nitrations  run  from  1  hour  and  16  hours  at  room  temperature  using 
an  aqueous  workup  gave  yields  of  69%  and  72%,  respectively.  This  would  suggest  that 
the  acetyl  displacement  is  nearly  complete  after  1  hour.  If  the  isolation  is  performed  using 
non-aqueous  conditions  the  yield  is  slightly  reduced,  64%.  In  general,  the  six  membered 
ring  starting  material  gives  a  50%  increase  in  isolated  yield  when  compared  to  1-acetyl- 
pyrrolidone. 
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l>Acetjl«4>piperid9De  (S):  2-flllsrii-4-pipsd<Sone  (12) 

Table  5.  Svatbesis  Sencl»«ft2i  ’Pr^idBcisig  l-?Stro~4~piptTidOBe  (12). 


SoItcsI  of  Sdl  R  of  Ratio  SoS»b! 

Eattr  Geaeratioa  T(*C}  R^XXO^  Rgt^bs  cf  K.M  'C,  time  Yitid  Rna 


1 

CHjCW 

• 

Ba 

ca  -.'i,  CT,  t  Sr 

22 

23A2-1S 

2 

CHzCb 

• 

Bn 

1:1 

CiiiCIi  2T,  !6  kr 

13.5 

23B2-24 

3 

CBjCk 

0 

Bs 

1:1 

CBjCl*  S'*',  1  fcr.  Nan 

31.5 

32A3-S 

The  structure  of  l-2ceQ'l-4-pipeiid(M!e  (5)  is  unique  in  com|arison  to  the  otha- 
heterocycles  used  in  this  study.  Table  5.  In  all  other  cases,  a  carbonyl  is  adjacent  ui  the 
nitrogen  functitmality,  Le.  amide  functionalities,  while  in  i-acetyl-4-pipeiidone  (5)  -s 
ketone  carbonyl  is  isolated  firom  the  nitrogen  in  addition  to  the  amide.  Thus  5  cot: Id  ^  shibit 
through-space  electronic  contributions  to  the  substitution  reactions.  Using  this  a  1- 
acetyl-4-piperidone  (5)  would  be  expected  to  have  yields  comparable  to  1-acetyl  prperidine. 
This  is  not  the  case.  The  yields  of  22%  (entry  1)  and  18%  (entry  2)  are  ccnsid  jrab:  y  lower 
using  an  aqueous  workup  and  is  31%  using  a  non-aqueous  workup.  SpecuInUon  on  the 
mechanism  may  suggest  that  die  ring  carbonyl  in  5  participates  in  some  fas’'  tion  to  the 
stabUiQr  of  the  reaction  intermediates  and  may  play  a  part  in  intermediate  dj  -cornpcsinon 
during  an  aqueous  workup. 

1-ImidazoIidone  (6):  Formation  of  l,3-dinitro>2-oxotetraiiydroltii:idazo!<e 
(13) 

The  cyclic  urea,  1-imidazolidone  (6),  has  a  stracture  which  ca;:  allo  w  jeson;sn.'e  of 
a  carbonyl  with  two  nitrogen  functionalities.  Table  6.  With  two  posiidk  it&cdc-i .  itc'.,  rn  o 
products  are  possible,  l,3-dinitro-2-oxotetrahydroimidazole  (13)  a  I-nitro-2- 
oxotetrahydro-imidazole  (14).  In  the  reactions  performed,  the  cb  ,i.rved  product  was  I V 
Using  an  aqueous  workup  yields  of  5.6%  for  13  was  observed  f«jr  a  reaction  run  entire  .y 
at  -20”  C.  However,  when  the  temperature  was  raised  to  0“  C  i  >;■  18  hours,  the  yield 
jumped  to  19%.  Substitution  of  nitromethane  as  the  reaction  so  vent  gave  a  comparable 
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low  yield  of  S%.  This  would  suggest  that  teagent  solubihty  with  nitnuneihane  was  not  a 
factor  in  tte  observed  yield.  When  the  leacdon  was  run  at  room  ten^eratme  fori  hour  and 
a  non-aqueous  w^oip  peifonned,  the  observed  yield  of  32.6%  was  considerably  higher 
t^-nn  in  earlier  experiments.  Since  no  mono-nltro  product  14  is  observed,  14  is  more 
neactive  than  the  carting  maieriaL  A  plausible  explanation  for  the  incicased  leacdvity  of  14 
towards  nitraticm  is  the  fonnation  of  a  product  whoe  the  lone  pair  of  the  nitrogen  in  the 
nitro-afiuno  is  in  resonance  with  the  carbonyl  and  the  nitro  group.  This  structure  would 
isolate  the  resorrance  contribudons  of  the  amide  linkages  and  give  the  unsubsdtuted  amide 
moie  andsss  character,  Le.  fiee  the  electrtm  pair  on  nitrogen  for  substitution. 

Table  6.  Synthesis  Reactions  Producing  l^-dinitro-lHrxotetrahydroiniidazoIe  (13). 


c  o  o 


6 

13 

14 

Entry 

Soivent  of 
GeneratioB 

Salt  R  of 
T(*C)  R4NSO, 

Ratio  Solvent 
R^Snbs  ot  Rx3 

*C,  time 

Yield 

Rna 

1 

CHjCli 

•20 

Bn 

2:1 

CHjCIt 

-20,  1  hr 

5.6 

13A1-36 

2 

CHjClj 

0 

Bu 

2:1 

CHjCl, 

0,  24  hr 

19 

14A1.3S 

3 

CHjNO, 

0 

Bn 

2:1 

CHiCl2 

0,  24  hr 

5 

15A1-40 

4 

CHjCl, 

0 

Bu 

2:1 

CHjCIi 

RT,  1  hr,  Naq 

32.6 

31A3.6 

Succinimide  (7):  Formation  of  N-Nitrosuccinimide  (15) 

In  succinimide  (7)  the  electron  pair  in  the  nitrogen  is  doubly  stabilized  by  the 
flanking  carbonyl  in  the  amide  linkages.  Table  7.  The  acidity  of  the  protons  on  the  atoms 
adjacent  to  two  carbonyl  is  well  known.  Thus  the  electrons  on  nitrogen  in  succinimide 
would  not  be  expected  to  be  available  as  in  other  amides  for  nitration.  When  nitration  was 
attempted  using  an  aqueous  workup,  no  N-nitro-succinimide  was  isolated.  On  careful 
inspection  during  the  reaction  it  appeared  that  any  product  formed  was  reacting  further 
during  isolation.  When  the  workup  was  changed  to  non-aqueous  conditions,  the  yield 
increased  dramatically  to  69%  and  76%.  Since  the  only  change  in  reaction  conditions  was 
a  5%  NaHCOs  quench  and  extraction,  the  result  suggest  that  water  destroys  the  product. 
In  a  non-aqueous  workup  reported  by  Suri  the  yield  was  28%  and  clearly  our  method  is 
significantly  better. 

To  rationalize  these  observations  one  must  consider  the  possible  hydrolysis  of  the 
N-nitrosuccinimide.  As  in  N-bromosuccinimide,  where  exposure  to  HBr  produces  Br2 
and  succinimide,  water  may  be  significantly  acidic  as  to  react  with  the  product  to  produce 
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siKxiiiiimde  and  nitric  acid,  equation  4,  This  sensitivi^  and  decon[q)osiricn  in  water  is 
eliminated  by  the  non-aqueous  workup. 

Table  7.  Syetbests  Reactioas  Prodnciog  N-Nitrosaccininiide  tlS). 


15 


Hydantoin  (8):  Formation  of  l>Nitrohydantoin  (16) 


The  structure  of  hydantoin  incorporates  the  features  of  succinimide  and 
pyrrolidinone.  Table  8.  Hydanotin  has  two  nitration  sites  and  there  was  a  big  difference  in 
workup  procedures.  While  use  of  nitromethane  or  tnejhylene  chloride  did  not  give 
substantial  amounts  of  1-nitrohydantoin  in  the  aqueous  wo."' :: ,  a  trace  amount  of  product 
(<5%)  was  obseived  when  nitromethane  was  used.  Realizing  u  sensitivity  to  workup 
and  the  observed  increased  yield  with  succinimide,  hydantoin  was  thought  to  have  die  same 
characteristics.  The  reaction  of  hydantoin,  with  a  non-aqueous  workup,  gave  69%  yield  of 
16  when  no  product  had  been  previously  isolated.  Interestingly,  the  initial  solid  isolated 
following  vhromoatography  haa  t  pale  yellow  color.  Storage  overnight  in  a  sealed  flask 
resulted  in  the  formation  of  a  brown-orange  gas.  This  would  indicate  an  outgassing  of  the 
sample  to  produce  NO2.  Characterization  of  the  initial  isolate  was  not  done,  however  the 
changes  would  suggest  that  the  colored  gas  is  generated  from  an  unstable  dinitro 
compound.  The  actual  structure  and  mechanism  of  this  decomposition  should  be  explored 
further.  In  light  of  the  structures  of  N-nitrosuccinimide  and  1-nitro-pyrrolidinone,  the 
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isolati(Mi  of  mono-nitrated  compound  with  the  nitro  group  on  the  least  reactive  amide  is 
CKpecxeA.  However  the  non-aqueous  woricup  may  limit  the  yield  considering  the  »on- 
aqueous  woikup  of  pynolidinone. 

Table  8.  Sjmthesis  Reactions  Prodocing  1-Nitrohydantoin  (16). 

SO^OTf 


s  16 


Entry 

Solvent  of 
Generation 

TCC) 

Salt  R  of 
R4NNO3 

Ratio 

Rgt/Snb 

Solvent 
of  Rxn 

*C,  time 

Yield 

Run 

1 

CftCIi 

•20 

Bu 

1:1 

CH2CI2 

•20,  1  hr 

0 

12A1-34 

2 

CUCIj 

0 

Ba 

1:1 

CI^CIj 

RT,  17  hr 

12B1-44 

3 

CH,NO|t 

0 

Bu 

1:1 

CHjNOj 

RT,  17  hr 

18A2-S 

4 

CHjNOi 

0 

H 

1:1 

CHjNOj 

RT,  17  hr 

<5 

19A2-10 

5 

CHjClj 

0 

Bu 

1:1 

CHjClj 

RT,  1  hr,  Naq 

69.2 

30A3.4 

Explorations  of  New  Nitration  Possibilities. 

Table  9.  Synthesis  Attempts  Using  New  Deoxygenation  Techniques. 


Not  Obtained 


Entry 

Solvent  of 
Generation 

Salt  R  of 
TCC)  R^NNO, 

Ratio 

Rgt/Subs 

Solvent 
of  Rxn 

*C,  time 

Yield 

Run 

1 

CHjCl, 

•  78 

H/SOCIj 

1:1 

CHjCIj 

•78,  1  hr 

4 

SAl-14 

2 

CHjCl, 

•60 

H/SOCli 

1:1 

CH2CI2 

•60,  •lO,  1  hr 

5B1-18 

3 

CHjCIj 

•  60 

DMSO/(COCI)i  1:1 

CHjCIj 

•60.RT,  1  hr 

0 

17A1-46 

(no  prod) 


An  examination  of  the  mechanism  of  nitronium  triflate  formation  gives  an  ovei-all 
deoxygenation  of  nitrate  anion,  cf  equation  1.  Similiar  reactions  have  been  used  to  activate 
carbon  dioxide.  From  this  observation  it  was  postulated  that  other  deoxygenation 
conditions,  especially  non-protic  conditions,  may  allow  deoxygenation  of  nitrate  anion. 
Table  9.  In  the  Swem  oxidation,  an  active  acid  chloride  deoxygenates  DMSO  to  form  a 
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^  new  Lewis  acid  which  then  reacts  with  alcohols,  equation  5.  Two  mechanisms  below  were 
*  postulated,  equations  6  and  7  In  each  case  m  active  N02'*'  or  NC)2C1  could  be  an 

electrophile  for  amide  functionalization.  Noig^  that  the  structure  in  each  has  a  stn jcture 
similiar  to  that  of  an  anhydride  like  nitronium  triflate.  When  the  reaction  was  performed 
using  SOQ2  as  the  acid  chloride,  a  new  product  chemically  different  from  l-nitro- 
pyrrolidinone  and  starting  material  was  observed.  By  mass  spectrometry  and  ^-JMR  the 
product  was  tentatively  assigned  as  the  nin-oso-amide  17,  equation  8. 
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{COClJi 


CH,/  CHiCI,.  .60-C 
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Conclusions 

New,  non-protic  N-nitration  methods  were  explored  using  nitronium  triflate, 
NO2OSO2CF3  (N020Tf),  generated  in  situ  from  tetrabutylammonium  nitrate,  Bi^NNOs, 
and  trifluoromethanesulfonic  anhydride,  (CF3S02)20  (Tf20).  2-PyiTolidinone, 
pyrrolidine,  N-acetyl-pyrrolidine,  N-acetyl-piperidine,  N-acetyl-4-piperidinone, 
imidazolidinone,  succinimide,  and  hydantoin  were  nitrated  using  these  methods.  Aqueous 
and  non-aqueous  workups  gave  yields  ranging  from  20-76%.  Aqueous  workup  on  2- 
pyrrolidinone,  pyrrolidine,  N-acelyl-pyrrolidine,  N-acetyl-piperidine,  and  N'acetyl-4- 
piperidinone  containing  one  carbonyl  involved  in  one  amide  likange  gave  the  best  results. 
Non-aqueous  workup  on  imidazolidinone,  succinimide,  and  hydantoin  gave  the  highest 
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yiel^.  Investigations  involving  nitromethane  as  a  solvent  showed  it  did  not  inciease  the 
yield  of  products,  and  served  only  in  a  solubilizing  capacity  for  reagents.  Reactions  using 
weak  base,  or  excess  starting  material  to  allow  it  to  participate  as  a  base,  did  not  improve 
the  overall  yields.  The  use  of  a  quatemaiy  ammonium  salt  nitrate  source  gready  improved 
the  yields,  and  allowed  all  reactions  to  be  run  in  pure  dichloroemdiane.  The  reaction 
methods  are  significandy  improved  and  eliminate  the  hazard  potential,  catalytic  and 
solubilizing  necessiQr  and  reaction  woikup  requirements  of  nitromethane  as  a  reaction 
solvent  previously  reported. 


Experimental  Section 

General  Comments:  All  reactions  were  carried  out  under  an  atmosphere  of  dry 
nitrogen.  Air  and/or  moisture  sensitive  reagents  were  handled  by  in  a  inert  atmopshere  dry 
box  and  flasks  were  kept  under  a  positive  nitrogen  atmosphere.  Methylene  chloride  was 
freshly  distilled  from  the  calcium  hydride  just  prior  to  use.  Tetrabutylammonium  nitrate 
and  trifluoromethanesulfonic  anhydride  was  obtained  from  Aldrich  and  handled  in  a  dry 
box.  Acetic  anhydride  was  obtained  from  MallinckrodL  All  other  solvents  and  reagents 
were  obtained  from  Aldrich,  or  reagent  grade,  and  used  without  further  purification. 
Reactions  were  monitored  by  thin  layer  chromatography  on  silica  gel  plates  (E.  Merck 
Kieselgel  60  F254)  and  column  or  filtration  chromatography  used  silica  gel  (J.  T.  Baker, 
60-200  mesh). 

High-field  NMR  spectra  were  recorded  on  Briiker  300  spectrometer.  Carbon-13 
and  proton  NMR  spectra  were  recorded  at  75.469  mHz  and  300.135  mHz  respectively. 
Chemical  shifts  are  reported  in  5  units,  part  per  million  downfield,  using  TMS  as  the 
reference  signal.  Carbon- 13  shifts  are  reported  in  6  units  using  CDCI3  or  DMSO  as  the 
solvent  and/or  reference  signal.  Infrared  spectra  were  obtained  using  a  Bio-Rad  FTS-7 IR 
spectrometer  with  a  SPC  3200  Data  Station  and  ran  neat  or  as  a  Kubelka-Munk  solid 
sample  with  KBr.  El  mass  spectra  were  recorded  on  a  HP  Model  5985  mass  spectrometer, 
with  an  HP  1000  Data  Station,  operating  at  70  eV.  Melting  points  were  taken  using  a 
Fisher  Melt-Temp  apparatus  and  are  uncoirected. 

General  Procedure  for  Nitrations  Using  Bu4NN03/TrifIic  anhydride,: 

Synthe  es  of  l*Niiropyrrolidinone  (9),  1-NitropyrroIidine  (10),  1-Nitro- 
piperic'.ine  (11),  and  l>Nitro-4«piperdinone  (12). 

In  a  100  mL  three  necked  round  bottom  flask  was  placed  5.0  mmol  (1.5224  g)  of 
BU4NNO3  and  to  this  added  30  mL  anhydrous  CH2CI2.  The  system  was  placed  under  a 
positive  nitrogen  atmosphere  and  cooled  to  O'C  using  a  salt/ice  bath.  A  10  or  25  mL 
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pressure  equalizing  dropping  fimnel  was  charged  with  5.0  nunol  (1.4107  g)  of  triflic 
anhydride  and  attached  to  the  reaction  flask.  The  triflic  anhydride  was  added  dropwise  to 
the  reaction  mixture  and  the  resulting  solution  immediately  turned  yellow  and  slowly 
formed  a  pasty  consistancy  with  formation  of  a  white  solid.  The  mixture  was  stirred  one 
hour  at  O’C.  In  a  dropping  funnel  was  placed  5.0  mmol  of  substrate  in  approximately  3 
mL  of  CH2CI2  and  the  solution  added  to  the  nitronium  trifiate  reaction  mixutre.  The 
reaction  mixture  was  kq)t  at  the  indicated  temperature  or  warmed  to  room  temperature  (RT) 
and  incubated  for  the  indicated  time.  After  the  reaction  was  complete,  25  mL  of  5% 
NaHCOs  was  added  to  quench  the  mixture  and  the  solution  stirred  for  15-30  minutes.  The 
reaction  mixture  was  tranfcrred  to  a  separatory  funnel  and  the  CH2CI2  layer  removed.  The 
aqueous  layer  was  extracted  1  x  25  mL  CH2CI2  and  the  CH2CI2  layers  combined.  The 
organic  layers  were  extracted  1  x  25  mL  5%  NaHC03, 1  x  25  mL  saturated  NaCl  solution, 
dried  with  MgS04,  filtered  and  reduced  in  volume,  A  orange  solid  or  oil  was  produced. 
The  reaction  mixture  was  chromatographed  on  a  silica  gel  column ,  1"  x  5",  using  400  mL 
of  1.3  ethyl  acetate-hexane  solvent  and  all  fractions  collected,  combined  and  concentrated  to 
give  the  product.  Compounds  9, 10, 13, 15,  and  16  gave  proton  and  carbon  spectra 
which  agreed  with  reported  literature  values.^  New  and  additional  information  for  the 
compounds  is  given  below, 

l-Nitropyrrolidinone  (9):  IR  (cm*l):  2980,  1774,  1766,  1559,  1554,  1276,  1229, 
1143,  1021,  935,  839,  806, 760,  688, 629. 

l-Nitropyrrolidine  (10):  IR  (cm-i):  2961,  1731,  1629,  1494,  1371,  1308,  1261, 
1026,  970,  910,  858,  763,  709. 

l-Nitropiperidine  (11):  iH-NMR  (5,  CDCI3):  3.86  (t,  4H),  1.72  (m,  4H),  1.60  (m, 
2H);  13C.NMR  (5,  CDCI3):  49.10, 23.98, 22.57;  IR  (cm-1):  2940,  2857, 1642, 1515, 
1442, 1387, 1326, 1276, 1240, 1063,  1012, 965,  854, 762;  MW  calcd.  for  C5H10N2O2 
130.15  g/mol;  Exact  mass  calcd.  for  C5H10N2O2  130.742206;  Analysis  calcd.  C,  46.15; 
H,  7.75;  N,  21.52;  0,  24.59;  Found. 

l.Nitro-4-piperidinone  (12):  MP  74-76'  C;  ^H-NMR  (5,  CDCI3):  4.28  (t,  2H), 
2.64  (t,  2H);  13c-NMR  (5,  CDCI3):  204.30, 46.42,  38.37;  IR  (cm-1):  3030,  2962, 

2918,  1719,  1460,  1439,  1368,  1313,  1308,  1275,  1246,  1150,  1108,  1016,  974,  949, 
828, 765, 744;  MW  calcd.  for  C5H8N2O3  144.13  g/mol;  Exact  mass  calcd.  for 
C5H8N2O3  144.0534855;  Analysis  calcd.  C,  41.67;  H,  5.60;  N,  19.43;  O,  33.30;  Found. 

Synthesis  of  l,3-Dinitro-2-oxotetrahydroimidazole  (13). 
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In  a  100  mL  three  necked  round  bottom  flask  was  placed  5.0  mmol  of  BU4NNO3 
and  to  this  added  30  mL  anhydrous  CH2CI2.  The  system  was  placed  under  a  positive 
nitrogen  atmosphere  and  cooled  to  O^C  using  a  salt/ice  bath.  A  10  or  25  mL  pressure 
equaliang  dropping  funnel  was  charged  with  5.0  mmol  of  triflic  anhydride  and  attached  to 
the  reaction  flask.  The  triflic  anhydride  was  added  dropwise  to  the  reaction  mixture  and  the 
resulting  solution  immediately  turned  yellow  and  slowly  formed  a  pasty  consistency  with 
formation  of  a  white  solid.  The  mixture  was  stirred  one  hour  at  O'C.  As  a  solid,  2.5  mmol 
of  imidazolidinone  was  added  to  the  nitronium  triflate  reaction  mixture.  The  reaction 
mixture  was  warmed  to  room  temperature  (RT)  and  incubated  for  the  indicated  time.  After 
the  reaction  was  complete  25  mL  of  5%  NaHCOs  was  added  to  quench  the  mixture  and  the 
solution  stirred  for  15-30  minutes.  The  reaction  mixture  was  tranferred  to  a  separatory 
funnel  and  the  CH2CI2  layer  removed.  The  aqueous  layer  was  extracted  1  x  25  mL 
CH2CI2  and  the  CH2CI2  layers  combined.  The  organic  layers  were  extracted  1  x  25  mL 
5%  NaHCOs,  1  x  25  mL  saturated  NaCl  solution,  dried  with  MgS04,  filtered  and  reduced 
in  volume.  An  orange  oil  was  produced.  The  reaction  mixture  was  chromatographed  on  a 
silica  gel  column ,  1"  x  5",  using  400  mL  of  1:3  ethyl  acetate-hexane  solvent  and  all 
fractions  collected,  combined  and  concentrated  to  give  the  product 
(13):  MP  209-21  r  C;  iH-NMR  (8,  DMSO):  4.12;  IR  (cm-J):  2960,  2924, 2855, 2515, 
2337,  2237,  1790,  1581,  1562,  1549,  1485,  1477,  1306,  1260,  1101,  1006,  948,  798, 
754,  738,715,  705. 

Formation  of  13  utilizing  the  general  non-aqueous  workup  described  below  gave  a 
yield  of  32.6%. 

Synthesis  of  N-Nitrosuccinimide  (15);  General  Non-aqueous  workup. 

In  a  100  mL  three  necked  round  bottom  flask  was  placed  5.0  mmol  of  BU4NNO3 
and  to  this  added  30  mL  anhydrous  CH2CI2.  The  system  was  placed  under  a  positive 
nitrogen  atmosphere  and  cooled  to  O’C  using  a  salt/ice  bath.  A  10  or  25  mL  pressure 
equalizing  dropping  funnel  was  charged  with  5.0  mmol  of  triflic  anhydride  and  attached  to 
the  reaction  flask.  The  trifiic  anhydride  was  added  dropwise  to  the  reaction  mixture  and  the 
resulting  solution  immediately  turned  yellow  and  slowly  formed  a  pasty  consistancy  with 
formation  of  a  white  solid.  The  mixture  was  stirred  one  hour  at  0°C.  As  a  solid  5.0  mmol 
of  finely  powdered  succinimide  was  added  to  the  nitronium  triflate  reaction  mixture.  The 
reaction  mixture  was  warmed  to  room  temperature  (RT)  and  incubated  for  the  indicated 
time.  After  the  reaction  was  complete  the  system  was  placed  on  a  rotoevaporator  and 
concentrated  at  room  temperature  under  reduced  pressure.  The  solid  thus  obtained  was 
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dissolved  with  1:1  ethyl  acetate/hexane  and  layered  onto  a  1"  x  5"  silica  gel  column  and 
eluted  with  1:3  ethyl  acetate/hexane.  All  fractions  were  collected  and  concentrated  under 
reduced  pressure  to  give  a  flaky  aeam  colored  solid.  The  melting  point  obtained  was  82- 
84  °C,  lower  than  the  literature  reported  89  “C.  The  proton  and  carbon  NMR  spectra 
agreed  with  the  reported  literature  values.^ 

(15) ;  MP  82-84“  C;  iR-NMR  (6,  DMSO):  2.83;  ISc-NMR  (5,  DMSO):  167.44, 26.86; 
IR  (cm-1):  3552,  3005, 2952, 1742, 1590,  1423,  1407,  1302,  1266, 1163, 1048, 1003, 
807,  732,  635. 

Synthesis  of  l>Nitrohydantoin  (16). 

In  a  100  mL  three  necked  round  bottom  flask  was  placed  5.0  mmol  of  BU4NNO3 
and  to  this  added  30  mL  anhydrous  CH2CI2.  The  system  was  placed  under  a  positive 
nitrogen  atmosphere  and  cooled  to  0°C  using  a  sall/ice  bath.  A  10  or  25  mL  pressure 
equalizing  dropping  funnel  was  charged  with  5.0  mmol  of  triflic  anhydride  and  attached  to 
the  reaction  flask.  The  triflic  anhydride  was  added  dropwise  to  the  reaction  mixture  and  the 
resulting  solution  immediately  turned  yellow  and  slowly  formed  a  pasty  consistancy  with 
formation  of  a  white  solid.  The  mixture  was  stirred  one  hour  at  0°C.  As  a  solid  2.5  mmol 
of  finely  powdered  hydantoin  was  added  to  the  nitronium  trifiate  reaction  mixture.  The 
reaction  mbcture  was  warmed  to  room  temperature  (RT)  and  incubated  for  the  indicated 
time.  After  the  reaction  was  complete  the  system  was  placed  on  a  rotoevaporator  and 
concentrated  at  room  temperature  under  reduced  pressure.  The  solid  thus  obtained  was 
dissolved  with  1:1  ethyl  acetate/hexane  and  layered  onto  a  1"  x  5"  silica  gel  column  and 
eluted  with  1:3  ethyl  acetate/hexane.  All  fractions  were  collected  and  concentrated  under 
reduced  pressure  to  give  a  pale  yellow  solid.  On  allowing  the  solid  to  stand  in  a  closed 
round  bottom  flask  overnight,  a  brown  orange  gas  developed  which  could  be  easily 
removed  by  flushing  with  nitrogen.  The  remaining  white  solid  gave  pure  16. 

(16) :  MP  171-173“  C;  iH-NMR  (5,  DMSO):  11.99  (s,  IH),  4.63  (s,  2H);  l^C-NMR 
(5,  DMSO):  166.02,  149.52,  51.92. 

Synthesis  of  N-Acetyl-pyrroIidine  (methyl  pyrrolidinyl  ketone)  (3), 

The  synthesis  of  N-acetyl-pyrrolidine  was  a  modified  procedure  of  Woodward. 
Additional  procedures  consulted  were  those  used  to  synthesize  HMX^  and  RDX,^  however 
the  N-acetyl-pyrrolidine  was  expected  to  be  a  liquid,  limiting  the  isolation  techniques 
utilized  for  HMX  and  RDX.  To  a  500  mL  round  bottom  flask  containing  20.40  g  (0.198 
mol)  of  acetic  anhydride  in  100  mL  CH2CI2,  cooled  to  0“  C,  was  added  dropwise  a 
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solutkmof  14.22g(0J99iTO9ofp>Tro5<Saeia50mLCH2<!i2Owgl5miiiBaoi.  Aa 
ex<^iernk:ieactionoccuned.  Thes$ssmwass^iiedl6ho«a5a£ioo!ceE!i|Ksafaie:  Tbe 
s^-siem  was  {daced  in  a  large  beaker,  ni^acQc  scniog.  asid  10  ii  sllo«%  at^ed  100  ni, 

5%NaH(X)3.  Tbe sclimoo evolved CO^snd was foiibsr({aend3edwii!BS(didNsHCX>| 
untflnoevobmonofCChoccuned.  Tbes>sigmwasp1acediaaseiiaia!flayfigmeland200 
niLH20  added,  the  system  shaken,  and  tbe  CH^Cb'cniofvcd.  Tbeaqoeonsl^'erwas 
extr<icted2xl59niL5%^^CO3.2x500mLlNHCl,  1  x5(H)mLs3!iiiaied?'^0 
-olution,  dried  with  MsS04,  IDtered  aod  concentrated  under  ledoced  piessiEC  to  pvea 
coloriessoil,  1^99g(lU  mmol),  5.8%  yield.  Note:  This  procedme  gave  a  sufiknem 
quantity  of  product  for  imtial  runs  however  it  does  not  lepiesent  a  fearible  route  for 
syntliesis. 

(3):  ^  T  NMR  (5,  CIXTy):  3.44  (m.,  4H),  2.04  (s,  2H),  1.84-2.00  (m,  4H);  l^C-CMR 
(5,  CD  :T3):  168.62, 46.91, 45.01, 25.63, 24.11;  IR  (cnr*):  3452, 2970, 2872, 1626, 
1452, 1427, 1343, 1225, 1194, 1020.986. 

Non-aqueous  wroilcup:  Nitration  of  2-pyrrolidmone  (9). 

An  identical  reaction  was  rq)eaied  as  for  2  pyrrolidiniMie  untO  the  initiation  of  tiie 
workup.  Following  incubation  for  the  1  hour,  the  ^'stem  was  placed  on  a  rotoevaporator 
and  concentrated  at  room  ten^>eratuie  under  reduced  pressure  to  give  an  orange  oiL  The  oil 
thus  obtained  was  dissolved  with  1:1  ethyl  acetai^hexane  and  layoed  onto  a  F  x  5”  sihca 
gel  column  and  eluted,  with  1:3  ethyl  acetat^hexane,  a  pale  yellow  band  which  on 
concentration  under  reduced  pressure  gave  a  yellow  oiL  This  oil  slowly  solidified.  The 
solid  was  insoluble  in  CDQ3  but  soluble  in  DMSO-dg.  The  spectra  was  too  complicated 
foi  assignment  and  the  major  signals  observed  are  listed  below. 

IH-NMR  (5,  DMSO):  1 1.96  (br),  6.08  (v  br),  5.1 1  (m),  4.27  (t),  4.04  (m),  3.44  (m), 

2.43  (m),  2.29  (m),  2.16  (m),  1.76  (m).  1.22  (m),  1.11  (m);  13C-NMR  (6.  DMSO): 
174.04,  117.88,  73.21,  67.24,  63.60,  60.14,  44.55,  30.86,  27.56,  21.96. 
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<jbB  Uiergal  decoE^xssitiGn  of  2,4,6-trijiit3X3tolus)e  CBH)  , 
3-iiitzD-l,2,4-1xiazol-'5-one  (tno) ,  their  deuterated  analogs,  and 
idxtures  thereof  have  been  studied  using  isofhersal  differential 
scanning  calorinetry.  Primary  isotope  effec±s  previoudy  noted  for 
both  ‘2SF  and  IflD  decxcpositicxts  were  diluted  or  altered,  probably  die 
to  the  different  sarple  pan  configuration  used  here,  inr  deocnposition 
appears  to  be  catalyzed  by  the  presence  of  Viton  rubber,  \diile  the  KDO 
decccposition  appears  to  be  catalyzed  by  confinesaent  of  its  prtxluct 
gasses.  Mixture  studies  indicate  that  a  2:1  mole  ratio  of  HIT  to  WTO 
is  sufficioit  to  cause  the  accelerated  deccnpositicxi  of  lur  and  consuae 
all  of  the  NTO  present  in  the  process. 

Introduction 

Althcuc^  NTO  was  first  synthesized  in  1905^,  interest  in  its  use  as 
an  ej^losive  has  just  recently  been  shcwn^.  fTID  does  not  appear  to 
suffer  the  sensitivity  prctolems  associated  with  its  more  famous 
counterparts  such  as  INT,  RDX,  and  HMX.  A  less  sensitive  e}5)losive 
currently  in  use,  TMB,  does  not  have  the  energetic  performance  of  RDX 
or  HMX.  Ihus  a  need  exists  for  the  develcprent  of  an  explosive  that  is 
more  safely  handled  than  inr,  HMX,  and  RDX  but  that  will  also  deliver 
the  explosive  characteristics.  NID  could  potentially  fill  such  a  role. 
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Mixtages  cf  WT  aad.  HR)  are  conaitly  under  investigation  as  pofcoitial 
eagflosives.  Ibou^  tiae  deccE^ositicn  of  ISIT  has  beoi  extensively 
studied,  very  little  week  has  been  directed  touaid  stuping  the 
decoEgosition  cf  USD  and  its  mxtures  witii  Of  particular 

interest  in  fht^  stu^  of  nixtaires  is  an  understanding  of  the  wprfianigM 
of  the  BHtual  dcocppositions  and  the  effects  of  varying  cc^xasitiens  of 
lUr  and  KDO. 


lha  prirary  tool  to  be  used  to  understand  the  mechanifan  of  the 
deocepositiion  is  the  Idnetic  dguberiua  isotope  e^ect  (KDIE)^. 

Briefly,  substitution  cf  a  dRirt:eri\ini  for  a  hydrogen  in  a  ocepeund  inay 
affect  the  rate  oenstants  observed  fro  the  reactions  of  that  ocopound. 
Isotope  effects  are  breten  into  three  categories  ~  prinary,  seocxidary, 
and  inverse.  A  prinary  isotope  effect  is  observed  if  the  substituted 
bend  is  brolosn  in  the  rate  limiting  step  of  the  reacti«i.  A  primary 
isotope  effect  leads  to  a  rate  constant  ratio  of  hydrogenated  to 
deuterated  cenpound  that  exceeds  1.35.  A  secondary  isotope 

effect  is  observed  if  a  bend  other  than  the  substituted  bond  is  broken 
in  the  rate  limiting  stqp  of  the  reaction.  A  seccxidary  isotope  effect 
leads  to  a  ratio  in  the  range  of  1.00-1.34,  d^aending  i:5)on 

the  distance  of  the  bond  broken  in  the  transition  state  fran  the 
substitution.  The  third  isotope  effect  is  referred  to  as  an  inverse 
isotope  effect.  In  this  case,  the  ratio  is  less  than  one. 

'The  KDIE  has  been  used  successfully  in  the  past  to  stuefy  the  thermal 
decomposition  of  INT,  HHX,  RDX,  and  TftTB^.  In  this  work,  the  thermal 
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deccE^iositicxis  of  lOO  and  KKMS  ai-e  studied  in  an  effort  to  better 
understand  their  nechanisns.  Additionally,  zdxtures  of  ISIT  and  KEO  and 
their  deuterated  analogs  are  ^ndied  to  evaluate  aiy  pctsitial 
catalytic  effect  of  eaA  ccTuxxaid  cn  the  other  as  as  to  better 
understand  the  aechanifra  of  their  thenaal  deocuposition. 

ExperiBBital  Section  and  Data  Analysis 

Isothertal  differential  scanning  calariiaeter  es^erinents  were  conhicted 
on  a  teaddn-HTraer  DSC-7  systaa  connected  to  a  Betidn  Kinpr-  7500  Data 
Station.  Berkin-Klmer  large  voluce  c^isules  (FE  part  no.  031d*-0(^} 
were  used  for  all  ea^eriments.  Ihough  sene  esgseriments  were  ocsiductsd 
without  the  O-ring  seal,  the  numerical  data  r^xsrtsd  here  cooes  frosa 
c^sules  that  were  sealed  with  O-rings.  Samples  were  loaded  into  the 
ESC  head  at  50  °C  and  the  tenpaature  was  increased  at  the  rate  of 
200  °C/inin  to  the  selected  isothennal  temperature.  All  sample  sizes 
were  in  the  range  of  1.75-2.15  mg. 

Sanples  of  HJT,  lUr-DS,  NTO  and  NID-D2  were  prepared  during  the  sunsaer 
of  1990  by  Ted  Burtey^  and  had  been  stored  in  a  dessicator.  A 
melting  point  determination  of  the  KiO  and  INT-DS  indicated  that  the 
material  was  substantially  unchanged  over  one  year. 

Mixtures  that  were  studied  were  made  by  dissolving  tlie  desired 
quantities  of  each  cemponent  in  acetone,  once  dissolution  was 
ccsiplete,  a  stream  of  dry  nitrogen  vras  passed  into  the  flask  vmtil  the 
acetone  was  no  longer  visuaJly  evident.  Ihe  flask  was  then  plac^  in  a 
vacuum  dessicator  for  approximately  one  hour  to  remove  the  last  traces 
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of  acebone. 


Delta  ftxza  tiie  7500  Data  Station  vexB  fed  into  an  lEH  ocngatible  PC 
isacfcine  in  a  sin^  Ir/te  hegeadRciral  fotnat.  A  laacxD  progran  was 
written  in  QDKmRD  PRO  to  convert  the  data  to  dedunnl  fooet  and  allow 
for  further  calculation.  Due  to  nesory  constraints  in  the  PC,  sccie 
files  needed  to  be  reduced  in  size.  Ihis  was  acccnplished  by  reading 
in  every  other  point  or  every  third  point  depending  t:pon  the  nunher  of 
data  points  in  the  DSC  file. 

Results 

Sasple  Van  Configuration 

A  wide  variety  of  sanple  pan  configuratiens  are  available  for  use  with 
the  Perkin-ELmer  DSC-7.  Variables  include  material  of  constructioi, 
volume,  and  pressure  oentainment  or  release.  East  work^  had 
indicated  that  stainless  steel  large  volume  capsules  would  work  in  this 
stu^,  and  that  it  would  be  beneficial  to  seal  such  pans  with  a  Viton 
0-ring  to  contain  gaseous  products. 

Ihere  are  at  least  three  drawbacks  to  sealing  the  vessel.  The  first  is 
that  the  various  e3q)eriments  will  no  longer  be  conducted  at  the  same 
measurable  pressure.  Measurements  of  mass  loss  after  e}5)eriinents 
indicate  that  decesrposition  processes  for  NID  and  HTT  are  considerably 
different,  with  the  NTO  converting  approximately  59  ±  8%  of  its  mass  to 
gaseous  (SATP)  products.  INT,  on  the  other  hand,  only  converts  about 
14  +  1%  to  gaseous  products.  Ihus  the  pressure  contained  in  a  sealed 
pan  at  the  end  of  the  deccarposition  could  be  considerably  different  for 
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the  tuD  isaterials. 


A  second  draoback  to  sealing  the  vessels;  is  that  the  pjoduct  gases 
contained  during  deconKasition  could  have  a  catalytic  effect  on  the 
process.  This  catalytic  effect  could  alter  the  mechanism  of 
dBconposition  or  mask  the  true  decocposition  kinetics  in  the  ^-stem. 

A  third  dranback  to  sealing  the  vessels  is  that  the  O-ring  itself  could 
catalyze  the  decoopositim  process.  HTO  deconposes  in  the  solid  state 
at  appnoodmately  262  °C  (visual  verification  in  melting  point  block) , 
so  it  is  imlikely  that  the  O-ring,  v^ch  is  not  in  ccxitact  with  the 
sample,  vnuld  accelerate  the  ^no  deccopositian  process.  INF,  m  the 
other  hand,  melts  at  80  °C  and  apparently  condenses  to  a  large  extent 
on  the  ceiling  of  the  pan  (visual  verificaticxi  after  ©g)eriroents)  and 
could  thus  have  an  opportunity  to  come  in  contact  with  the  O-ring. 

In  order  to  investigate  these  effects,  ejgjeriments  were  ccaiducted  using 
cells  that  were  sealed,  not  sealed,  or  only  partially  sealed.  For  the 
partially  sealed  cells,  approximately  one-fourth  of  the  O-ring  was 
removed  and  the  stainless  steel  pan  was  "sealed”  with  the  remaining 
three-fourths  O-ring.  Ihe  peak  time  for  the  INT  at  257  °C  without 
O-rJjig  was  considerably  delayed  carpared  to  the  sealed  vessel  (30  min. 
conpared  to  15  min. ) .  In  addition,  the  unsealed  vessel  gave  a  less 
well  defined  and  reproducible  exotherm  than  the  sealed.  Mass  loss 
measurements  indicate  that  the  stainless  steel  pans  do  not  seal 
consistently  if  the  O-ring  is  emitted.  Inclusion  of  a  three-fourths 


30-5 


O-ring  in  the  vessd  gcive  esssitially  tsro  excthexss  -  a  siiaip  am 
iBmecjjatdy  hitting  257  and  a  xuch  scalier  delayed  one  that 
oocre^xxids  affnxodjoately  to  the  large  exotherm  of  the  sealed  system. 
Addition  of  Oring  bits  to  the  sample  pan  without  an  O-ring  seal  led  to 
results  that  were  very  similar  to  the  first  exotherm  in  the  partially 
sealed  system.  It  ^gpears  that  the  IMT  deocsposition  is  catalyzed  ly 
contact  with  the  O-ring. 

rSealing  of  the  vessel  ccxitciining  KTO  with  a  Vital  O-ring  also 
apparently  catalyzes  its  thermal  deconpositicai  relative  to  the  unsealed 
vessel.  Oantaimnent  of  the  pressure  leads  to  a  sharper  more 
r^roducible  exotherm.  Mass  loss  measurements  indicate  that  virtually 
all  of  the  gaseous  products  escape  during  the  run  if  the  O-ring  seal  is 
ondt^.  A  partially  sealed  vessel  results  in  a  thermogram  quite 
similar  to  the  unsealed  vessel.  Addition  of  O-ring  bits  to  the  sample 
pan  produced  no  visible  exotherm  after  40  min  at  235  °C  in  an 
unsealed  vessel,  ^Mle  NTO  deccstposed  in  20  minutes  in  an  O-ring  sealed 
vessel  at  the  same  temperature  Biis  indicates  that  the  containment  of 
the  product  gasses  or  the  increased  pressure  is  more  likely  catalyzing 
the  NTO  deccjposition  than  the  presence  of  the  O-ring. 

The  experiments  conducted  here  were  conducted  in  vessels  sealed  with 
0-rings  due  to  the  better  reproducibility  and  the  large  amount  of 
gaseous  products  formed,  particularly  in  the  case  of  NTO.  Activation 
parameters  determined  will  reflect  the  catalyzed  process  and  may  not  be 
readily  transferable  to  the  uncatalyzed  processes.  However,  the 
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zelative  activaticn  parameters  may  ^led  considerable  on  the 

interaction  of  NID  and  INF  and  their  deuterated  analogs  during  the 
thermal  deocnpositicn  process. 

General  Description  of  Analysis 

Figure  1  illustrates  the  general  appearance  of  all  IDSC  thennograms 
obtained  in  this  stu^.  Ihere  are  two  exotherms  evideiit  -  a  small  csie 
early  in  time  that  generally  initiates  inmediately  vpm  reaching  the 
chosen  tenperature  and  a  secxsnd  larger  one  that  follows. 

Biou^  there  cxjuld  be  seme  interesting  aspects  to  the  first  exotherm, 
tenperatures  used  in  this  study  were  not  lew  enou^  to  observe  the 
vhoie  exotherm.  One  way  of  approximating  kinetic  ciata  fixsn  the  first 
peak  is  to  record  an  induction  time.  Biis  corresponds  to  the  hi^  heat 
flew  point  between  the  first  and  second  exotherm  (point  A  in  Figure 
1) .  Uiis  induction  time  corresponds  rou^ily  to  the  time  of  completion 
of  the  first  exothermic  process.  Biis  set  of  data  is  referred  to  as 
induction  time  data'  in  the  following  discussion. 

Ihe  second  exotherm  allows  for  a  more  conplete  kinetic  analysis.  The 
0)APrRD  FRO  program  produced  the  three  plots  given  in  Figure  2  from 
vhich  the  quantitative  analysis  proceeded. 

Ihe  first  plot  includes  the  data,  the  selected  baseline  and  the  natural 
logarithm  of  the  baseline  minus  the  heat  flow. 
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Hie  second  plot  ocnpares  an  aubocatalytic  treatment  of  the  sheeted 
pe^  vith  a  first  order  treatment.  Rogers^  has  ^lown  that,  for  a 
first  order  reaction,  the  In  b,  vhere  b  is  the  heat  flow.  Is  linearly 
r^ted  to  the  time.  Inplicit  in  this  derivaticxi  is  also  the  fact  lhat 
for  a  first  order  reacticsi  ~ln{l-«^  (where  o<.  is  the  degree  of 
ccawersicxi)  must  be  linearly  related  to  time.  The  first  two  plots 
allow  one  to  ocepare  the  linearity  of  In  b  (fixm  the  first  plot)  with 
the  linearity  of  -ln(l-o<)  (from  the  second  plot) .  If  both  sets  of 
data  are  not  linear  in  the  same  region,  the  reaction  is  most  likely  not 
first  order.  In  this  study,  there  did  not  appear  to  be  substantial 
first  order  bdiavior  in  any  of  the  eperiments. 

Hie  second  curve  of  the  second  plot  illustrates  the  behavior  of  l~c< 
versus  time.  In  aubocatalytic  reaction,  this  plot  will  assume  a 
characteristic  decreasing  sigmoidal  ^pe. 

Hie  third  plot  of  Figure  2  allows  one  to  calculate  the  aubocatalytic 
rate  constant.  An  aubocatalytic  reaction  will  render  a  significant 
linear  portion  in  this  plot^  with  a  slope  of  k.  All  of  the 
experiments  conducted  here  were  treated  with  autocatalytic  kinetics. 

In  every  case,  the  linear  portion  extended  at  least  fresn  (l-«>^)  values 
of  0.06  to  0.17  which  corresponds  to  of  0.065  to  0.22. 
iwr  and  Iwr-D3: 

Figure  3  gives  tlie  Arrhenius  plots  for  INT  and  INT-DS  vhere  k  comes 
from  the  autocatalytic  treatment  (Figure  3a)  and  vhere  k  is  siitply  the 
reciprexal  of  the  induction  time  (Figure  3b) .  Autocatalytic  treatment 
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of  the  data  leads  to  an  activaticsi  energy  (E^)  for  INT  decxsiposition 
of  27^5  it  0.8  taal/mol  and  for  INr-Da  of  23  +  2  Jocal/mol.  Omission  of 
the  two  hitler  tenperature  points  frcni  TNT-DS  leads  to  an  of  27  i. 

1  Jaal/mol,  in  close  agreement  with  the  HIT.  Ocnpariscxi  of  the 
inducticai  time  activation  energies  leads  to  similar  activation  energies 
again,  22.0  JdO.6  fccal/mol  for  TKT  and  22  i  1  kcal/mol  for  TNIVDS. 
Activation  energies  obtained  frcm  the  aiitocatalytic  treatment  are  very 
similar  to  those  found  in  past  vrork  conducted  with  unsealed  aluminum 
pans.^ 

The  average  isotopic  ratio  <±tained  in  this  work  is  1.11  i.  0.05  for 
autocatalytic  treatment  and  1.11  +  0.09  for  induction  time  treatment 
over  the  tenperature  range  studied  (245-265°C) .  Hus  contrasts  quite 
sharply  with  values  found  earlier^  of  1.35  +  0.02  and  1.6  jt  0.02, 
respectively.  Hie  major  differences  in  the  studies  center  on  the 
sanple  pan  configuration.  Hie  alimdnura  pans  used  in  the  earlier  stucfy 
will  not  contain  pressure  above  ambient,  while  the  stainless  steel  pans 
used  in  this  study  will  contain  v;p  to  24  atmospheres  v^en  sealed  with  a 
rubber  O-ring.  Hie  O-ring  itself  introduces  another  factor  as  noted 
earlier,  the  possibility  of  catalytic  action. 

As  was  noted  earlier,  the  HIT  decortposition  seems  to  be  catalyzed  by 
the  presence  of  the  O-ring.  A  question  arises  as  to  vhether  the 
catalytic  effect  of  the  O-rings  dilutes  or  even  removes  the  isotope 
effect.  A  conparison  of  the  past  work  and  current  work  suggests  that 
there  is  such  an  effect.  A  comparison  of  HIT  and  H1T-D3  runs  without 
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0-rings  at  250  °C  gives  and  inducticjn  time  ratio  of  3.12,  much 
different  than  the  values  obtained  ;*en  0-rings  are  present.  Ifess  loss 
measurements  indicate  that  loss  of  product  gasses  is  not  a  prctolem  even 
in  the  absence  of  the  0-ring.  Presence  of  the  0-ring  apparently  alters 
the  mechanism  or  speeds  the  rate  enou^  that  the  isotope  effect  either 
becomes  secondary  in  nature  or  is  diluted  in  magnitude. 

NTO  and  Wr0-D2 

Figure  4  gives  the  Arrhenius  plots  for  NTO  and  N1X>-D2  where  k  comes 
from  the  autocatalytic  treatment  (Figure  4a)  and  vhere  k  is  siirply  the 
reciprocal  of  the  induction  time  (Figure  4b) .  Ihe  autocatalytic  plots 
are  scattered,  particularly  for  NID-D2.  It  is  difficult,  based  1:5x30 
the  data  presented,  to  arrive  at  a  reasonable  linear  fit.  The  induction 
time  plot  is,  however,  much  inproved  over  the  autocatalytic  treatment 
and  leads  to  activation  energies  of  47.5  +  1.1  and  53.6  ±  3.5  kcal/mol 
for  NTO  and  NIO-D2,  respectively.  Ihese  values  are  considerably  lower 
than  earlier  work^  vhich  produced  activation  energies  of  87.5  +  1.8 
and  88.4  ±.  1.2  kcal/mol  for  NTO  and  NT0-D2,  respectively.  There  is  the 
possibility  that  the  stainless  steel  catalyzes  the  NTO  deconposition 
relative  to  the  glass  used  in  the  other  stucfy  or  that  confinement  of 
produc±  gasses  is  affecting  the  deconposition. 

Isot(p)ic  ratios  obtained  in  this  work  seem  to  decrease  rather 
dramatically  as  tenperature  increases  over  the  limited  tenperature 
range  investigated  (230-243  °C) .  Due  to  the  substantially  higher 
activation  energy  of  NTO  over  TNT,  the  tenperature  region  over  which 
useful  data  may  be  obtained  is  limited.  At  low  tenperature  (220-230 
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°C) ,  thermograins  are  erratic  and  not  generally  ri^roducible.  At 
hi^er  tenperatures  (above  245  °C) ,  the  reaction  initiates  before  the 
DSC  has  stabilized.  It  is  difficult  to  say  from  this  work 
unequivocally  vAiether  the  isotope  effect  is  primary  or  secondary  for 
NID  decomposition  under  these  conditions. 

TOTT/Wro  Mixtures 

Figure  5  gives  Arrhenius  plots  for  the  thermal  decomposition  of  INT  and 
NTO  mixtures  vhere  data  is  again  extracted  from  the  second  major 
exotherm.  There  is  apparently  no  large  effect  due  to  the  isotqpic 
substitutions  indicated  in  the  figure.  Activation  energies  for  the  36 
mol  %  TNT  samples  range  from  60  2  kcal/mol  for  TNT/NID  to  65  ^  1 

kcal/mol  for  the  TNr-D3/NID-D2  mixtures.  Isotopic  ratios  range  from 
1.42  at  220  °C  to  1.18  at  235  °C.  The  activation  energy  for  the 
decomposition  is  in  the  nei^iborhood  of  the  NTO  activation  energy,  and 
the  peak  area  (average  1120  J/g)  is  in  the  vicinity  ejpected  for  a  36 
mol%  TNT  mixture  (885  J/g) .  The  second  exotherm  in  these  mixtures  may 
correspond  to  the  deccmoposition  of  excess  NTO.  The  TNT  decomposition 
apparently  corresponds  to  the  first  exotherm  observed. 

It  is  of  interest  to  determine  at  vhich  composition  of  mixture  the 
second  exotherm  is  no  longer  evident.  Data  taken  with  a  1.9  mol  %  NTO 
mixture  suggest  that  NTO  decomposition  at  this  level  is  either  not 
within  instrumental  limits  of  detection  or  that  the  small  amount  of  NID 
is  decomposed  in  the  early  exotherm.  NTO  levels  were  increased  to  17, 
33,  50,  and  64  mol  %.  Figure  6  shows  the  results  of  the  ensuing 
thermograms  at  235  °C.  It  appears  that  the  NTO  concentration  beyond 
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the  33  mol  %  level  results  in  excess  NTO  remaining  after  catalyzing  the 
TNT  decomjposition. 

Conclusions 

Isotope  effects  previously  seen  in  HIT  decorpositions  using  IDSC  were 
not  as  prcaninent  in  this  work,  i^parently  the  presence  of  the  Viton 
0-ring  catalyzes  the  HJT  decorpcsition  and  dilutes  the  isotope  effect 
or  alters  the  mechanism  sufficiently  to  change  the  rate  determining 
sb^. 

The  thermal  decarposition  of  NTO  can  be  followed  ly  IDSC  over  only  a 
very  narrcw  range  of  tenperature  <&ie  to  its  hi^er  activation  energy 
than  that  of  INT.  Results  suggest  a  possible  isotcpe  effect  that  is 
strongly  tenperature  dependent  in  0-ring  sealed  stainless  steel  pans. 
Ihe  deconposition  of  the  NTO  is  catalyzed  by  the  confinement  of  its 
products  gasses. 

Mixture  studies  of  INT  and  NIC  suggest  that  a  33  mol  %  mixture  of  NIC 
is  sufficient  to  catalyze  the  deccsrposition  of  TNT.  Levels  of  NTO  in 
excess  of  that  result  in  excess  NTO  remaining  after  TNT  deconposition. 
Conputer  calculations  have  indicated  that  the  energies  of  the  two  N-H 
bonds  on  NTO  are  similar^®,  so  it  may  be  possible  that  two  TNT 
molecules  can  attack  each  NTO. 

A  method  that  could  be  used  to  better  study  the  thermal  decarposition 
of  NTO  is  thermogravimetric  analysis  (TGA) .  The  NTO  decatpc.sition 
produces  about  G0%  of  its  mass  as  gaseous  products  at  SATP,  tut  the 
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deccKpsition  agiaKently  occurs  Ib  tiie  solid  tyasB.  Also,  sauce  tise 
ICA  does  not  have  the  settling  t±Qe  that  is  a  pccideBB  in  OGC,  the 
teqperaltge  range  to  he  studied  could  pahaps  be  epanded.  ISAwould 
also  allotf  ppodact  garses  to  be  suEpt  acay,  zenoidng  the  possibility  of 
catalytic  actixxi  of  these  species. 
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figure  2a 


/vrO  Isothermal  at 235  C 
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Figure  2fa 


NTO  Isothermal  at  235  C 
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Autocatalytic  Plot  for  TNT  and  TNT-D3 


»  TNT  +  TNT-D3 


Figure  3a 


Induction  Time  Plot  for  TNT  and  TNT-D3 


m  TNT  +  TNT-D3 
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Autocatalytic  Plot  for  NTO  and  NT0-D2 


-3.5 


*6.5-! - 1 - r 

-1.01  -1.005  -1 


■0.995  -0.99  -0.985  -0.98  -0.975  -0.97 
-1JRT  (xkcal/mol) 


■  NTO  +  NT0-D2 


Figure  4a 


Induction  Time  Plot  for  NTO  and  NT0-D2 
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Autocatalytic  Plot  for  TNT  INTO  Mixtures 
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TERNARY  PHASE  DIAGRAM  OF  MEIC/Naa/Aiaj 
Dr  Do  Ren  Cliang 

Chemistry  Department,  Averett  College,  Danville,  VA  24541 


Abstract: 

Phase  diagram  of  MEIC/NaCl/AlCla  ternary  system  for  solid-liquid  transitions  has 
been  investigated  extensively  using  DSC.  A  contour  map  of  isotherms  has  been 
constructed  from  these  measurements.  It  reveals  that  the  vast  domain  of  this 
system  is  liquid  at  room  temperature  and  along  0.5  mole  fraction  of  AICI3  line 
parallel  to  the  MEIC/NaCl  binary  is  the  ridge  of  the  system.  The  MEIC/NaCl 
binary  phase  diagram  has  been  studied  also  for  the  completeness.  A  new  compound 
is  formed  with  MEIC/NaCl  in  2:1  ratio. 

Introduction: 

Molten  salts  as  electrolytes  have  been  investigated  extensively  for  battery 
applications  [1].  Some  molten  salts  possess  a  wide  electrochemical  window  which 
may  lead  to  a  battery  with  high  potential  and  high  theoretical  specific  energy,  thus 
it  may  be  operated  at  high  cunent  densities  without  too  high  power  losses  because 
of  the  high  conductivity  of  molten  salts.  However,  most  of  the  available  molten 
salts  batteries  are  operated  at  significant  high  temperature  due  to  the  electrolyte 
used.  A  molten  salt  system  consists  of  MEIC  (l-methyl-3-ethylimidazolium 
chloride)  and  AICI3  remaining  in  liquid  at  room  temperature  was  first  predicted  and 
synthesized  at  FJSRL  [2,  3].  These  chloroaluminate  melts  not  only  have  a  wide 
electrochemical  window  but  also  are  liquid  at  room  temperature  between  N=0.Z  to 
0.7,  where  N  is  the  mole  fraction  of  AICI3  [4].  For  N<0.5  are  called  basic  melts. 
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The  main  ionic  spedes  in  this  le^on  are  MEI%  Cl'  and  AICI4';  and  i\r>0.5  are  called 
addic  melts,  besides  MEP,  the  dominating  anion  spedes  are  AlCl4~  AI2CI7'. 
The  AlaClio'  appear  when  N>0.75  [5].  For  N=0.5  is  called  a  neutral  melt  and  has 
been  shown  experimentally  [6]  to  have  the  widest  electrochemical  window  of 
approximately  of  4.4  V.  However,  once  an  electrochemical  reaction  takes  place  in 
the  cell,  the  electrolyte  moves  away  from  neutrality.  Recently,  NaCl  was  discovered 
to  have  bufrer  action  in  the  neutral  melts  [7].  Relatively  little  is  known  about  this 
ternary  system  of  MElC/NaCl/AlCls.  This  report  details  the  phase  behavior  of  this 
system  in  solid-liquid  transition  region.  Along  with  the  binary  MEIC/NaCl 
presented  here,  together  MEIC/AICI3  [4]  and  NaCl/AlCls  [8,  9]  gave  a  complete 
picture  of  the  ternary. 

Experiment: 

(1)  Melts  preparation:  MEIC  and  aluminum  chloride  are  prepared  and  purified  as 
described  in  reference  2.  Sodium  chloride  with  99.999%  purity  is  purchased  from 
Aldrich  and  is  used  directly  without  any  treatment.  All  mixtures  are  prepared  and 
weighed  inside  the  glovebox  under  helium. 

Appropriate  different  amounts  of  MEIC  and  NaCi  are  mixed  first,  then  brought  to 
melt  under  lOQoC,  cooled  back  to  solid  and  then  grinded  into  powder.  These  binary 
compositions  are  then  u'  ’  ^uepare  ternary  compositions  by  adding  different 
amounts  of  AICI3.  Six  binary  series  are  selected  in  this  study.  They  are  A,  C,  E,  G, 
I  and  K.  corresponding  to  mole  fraction  of  NaCl  in  MEIC/NaCl  binary  of  about 
0.1,  0.3,  0.4,  0.5,  0.6  and  0.7  respectively.  When  AICI3  is  added  to  each  series,  some 
form  gel,  some  form  clear  liquid  and  some  have  undissolved  excess  AICI3  at  room 
temperature.  Small  heat  is  applied  (less  than  50°  in  temperature)  to 
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help  dissolve  and  mix.  Since  AICI3  is  highly  volatile,  tempeiatnie  above  60°C  is 
undesirable  to  avoid  the  change  of  composition  in  the  sample.  Some  other  binary 
N?fn /AICI3  have  also  been  used  to  prepare  the  ternary  system  in  this  study. 

(2)  Measurement: 

For  visual  observation,  samples  are  loaded  into  the  capillary  tube,  the  open  end  of 
the  tube  is  then  temporarily  sealed  with  paraffin  film,  then  moved  outside  the 
glovebox  and  quickly  sealed  by  flame.  Melting  point  is  observed  with  Meltemp 
device  if  the  melting  point  is  greater  than  the  room  temperature.  For  DSC 
measurement,  about  10  mg  samples  are  sealed  in  a  steel  pan  fitted  with  an  o-ring 
cover  pan.  Perkin-Elmer  7  series  is  used.  This  includes  DSC-7,  TAC-7 
Instrument  Controller,  Perkin-Elmer  7500  professional  computer,  Perkin-Elmer 
graphics  plotter  and  an  intracooler  with  the  capability  to  lower  the  temperature  to 
— 70oC.  Samples  are  scanned  at  a  fixed  rate  of  5o/min.  AU  binary  samples  showed 
an  endo  peak  in  the  DSC  run.  The  results  are  shown  in  Table  1  and  Figure  1.  A 
typical  DSC  run  for  a  ternary  sample  can  have  one  exo  and  several  endo  peaks,  or 
one  exo  and  one  endo  peak.  If  no  peak  appears  this  means  that  the  melting  point  or 
glass  transition  temperature  is  very  low,  beyond  the  ability  of  the  intracooler.  In 
such  a  situation,  the  melting  point  is  carried  out  with  the  following  set  up  as  shown 
in  Figure  2.  For  viewing  at  subambient  temperature,  the  outside  air  jacket  needs  to 
be  sprayed  with  methanol  occasionally.  The  results  for  the  ternary  are  shown  in 
Table  2. 

Result  and  Discussion: 

For  MEIC/NaCl  binary,  in  general,  the  visual  observation  differs  only  slightly  from 
DSC  values.  For  a  wide  range  of  compositions,  the  melting  points  varied  slightly. 
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This  unexpected  result  in(hcates  that  the  liquid  melts  are  nearly  pure  MEIC,  that  is 
the  very  low  solubility  of  NaQ.  The  existing  local  maximum  at  33  mol  %  NaQ 
reveals  a  compound  formation  of  2  to  1  in  MEIC  to  NaCl,  with  chemical  formula 
(MEI)2NaCl3.  As  shown  in  Figure  1,  there  exists  an  eutectic  composition  at  60  mol 
%  NaG  and  melts  at  85°C.  For  ternary  MEIC/NaG/AlCls  system,  since 
experiments  are  carried  out  mostly  in  such  a  way  from  six  preselected  binary  series, 
then  by  adding  AICI3  to  each  series.  Thus,  in  the  ternary  phase  diagram  as  shown 
in  Figure  3,  these  paths  correspond  to  by  connecting  points  A,  C,  E,  G,  I  and  K  on 
the  MEIC/NaCl  side  to  the  apex  AICI3  respectively.  Along  a  particular  path,  the 
ratio  of  MEIC  to  NaCl  is  fixed.  The  melting  point  will  change  according  to 
different  amounts  of  AICI3  being  added.  The  results  are  shown  in  Table  2.  As  an 
illustration  for  this  G-series,  the  exo  peak  corresponds  most  likely  to  a  glass  to 
crystalline  solid  transition  and  the  endo  peaks  correspond  to  the  solid-solid  phase 
transitions  or  to  the  solid-liquid  transition  if  the  endo  peak  occurs  at  the  highest 
temperature.  Based  on  these  interpretations,  the  phase  diagram  along  the  G-series 
is  constructed  as  shown  in  Figure  4.  A  local  maximum  is  called  peritectic  point  in 
ternary  corresponds  at  the  following  compositions  of  mole  fraction  AICI3  0.5,  MEIC 
0.25  and  NaCl  0.25  melts  at  temperature  36oC.  It  is  a  new  compound 
(MEI)Na(AlCl4)2  and  a  double  salt  of  MEI  AICI4  and  NaAlCU.  There  exists  also 
another  less  pronounced  peritectic  point  and  several  eutectic  points  of  ternary. 
Most  of  them  below  the  room  temperature.  If  the  peritectic  point  is  approached 
from  both  sides  of  the  G-line,  the  slope  is  rather  steep.  By  combining  all  the  data 
studied  thus  far,  it  showed  that  a  maximum  exists  always  at  0.5  mole  fraction  in 
AICI3  in  this  ternary  system.  Along  this  0.5  mole  fraction  AICI3  line  MN  parallel  to 
the  MEIC/NaCl  binary  side  in  Figure  3,  the  ternary  system  can  be  regarded  as  a 
pseudo-binary  of  MEI  AlCU/NaAlCU.  The  melting  point  of  this  pseudo-binary 
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has  been  obtained  [10].  The  present  result  along  the  MN  line  is  shown  in  Figure  5 
for  the  comparison  with  the  reference  10.  The  results  confirm  the  general  features  of 
the  pseudo-binary  and  this  is  a  ridge  line  in  the  ternary  system.  As  the  peritectic 
point  is  approached  from  both  sides  of  the  MN  line,  the  slope  is  much  less  steep 
than  along  the  G-line.  This  indicates  that  a  strong  dependence  on  the  AICI3 
component  in  the  ternary  system.  This  is  understandable  due  to  chemical  reactions 
occuring  between  AICI3  and  MEIC  and  NaCl  respectively.  The  agreement  between 
the  two  results  is  remarkable. 

The  mapping  of  isotherms  can  be  done  in  the  ternary  system  by  incorporating  all 
three  binaries  involved.  As  shown  in  Figure  3,  the  contour  lirts  of  the  melting  point 
are  made  for  20°  intervals  for  clarity  in  high  temperature  range  and  smooth 
interpolation  can  be  made  for  intermediate  temperature  in  low  temperature  domain. 
The  shaded  area  shown  in  the  figure  has  not  been  investigated  due  to  excess 
insoluble  components  that  exist  at  the  preparation  of  the  sample. 

Conclusion:  It  reveals  a  new  compound  (MEI)2NaCl3  formed  in  binary  MEIC/NaCl 
and  a  new  compound  (MEI)Na(AlCi4)2  in  the  ternary  system.  The  vast  domain  of 
this  ternary  is  in  liquid  form  at  room  temperature.  It  is  an  excellent  candidate  as 
an  electrolyte  for  molten  salt  battery  system.  For  buffered  neutral  melts  is 
concerned,  a  wide  range  of  NaCl  can  be  used. 
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TaSdsl 


Mole  &acticn  of 

Matins 

Pciat  fcCl 

Sample 

NaQ  in  the  mdt 

Vlsiaai 

DSCtmset 

A 

0.1032 

86.0 

86.50 

B 

0.2021 

85.S 

86.49 

C 

0.3165 

87.0 

86.73 

D 

0.3556 

86.6 

87.06 ' 

E 

0.3568 

86.6 

86.74- 

F 

0.4601 

86.3 

86J9 

G 

0.4559 

86.3 

86.43 

H 

0.3315 

86.0 

8o.SS 

I 

0.6038 

85.4 

85.01 

J 

0.6520 

85.0 

85.79 

K 

0.7C67 

87.2 

85.09 

L 

0.7541 

87.3 

85.37 

M 

0.8G57 

86.5 

86.52 

N 

0.5020 

87.0* 

85.37* 

0 

0.9508 

86.3* 

84.88* 

P 

0.8357 

86.6* 

85.61* 

MEIC 

0 

87.0 

8i.34 

*Samples  N,  0  and  P  are  not  completely  melted  under  ICflcC  during  ther  preparation,  but 
were  ground  to  mix  well.  Under  visual  observation,  these  samples  appear  as  wet  at  their 
respective  temperatures-  These  are  taken  as  a  sign  of  a  phase  transition,  even  though  the 
entire  sample  remains  as  solid  unto  245^0  and  some  decomposition  is  shown  by  brownish 
color. 
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MEIC/NaCI  Phase  Diagram 
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MElC/NaCl  binary  phase  diagram 
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figure  2.  Mt;lbii\g  point  fiovice 
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Table  2.  Welting  Points  of  the  WEIC/NaCl/AlCl j  Ternary 
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0.65  -94b  -44.683a,  4.772  -39.858®,  1.694  -33.076a,  -2.043 

0.70  87.5b  -49.066®,  5.730  -39.343®,  -0.441  -20.0®,  1.964 

exo  peak,  involving  glass  to  crystalline  transition  likely 
visual  observation 


ImAICI,  NaAICI 


MOLE  FRACTION 


PHOTO-ELECTRONIC  NONLINEAR 
EXCITATIONS  AND  WAVE  PROPAGATION  IN 
PERIODICALLY  MODULATED  MEDIA 

Dr.  Marek  Grabowski* 

Frank  J.  Seiler  Research  Laboratory,  US  Air  Force  Academy,  CO  80840-6528 

The  propagation  of  electronic  and  electromagnetic  waves  in  periodically  modulated, 
nonlinear  media  is  studied  in  terms  of  a  discrete  dynamical  system.  The  analysis  of  this 
dynamical  system  is  applied  to  the  problems  of  tunneling  of  carriers  in  one-dimensional 
arrays  of  semimagnetic  quantum  dots,  nonlinear  photo-electronic  excitations  in  saturable, 
dispersive  and  absorptive  periodic  medium,  and  bichromatic  optical  wave  propagation  in 
electrically  poled  waveguides.  New  results  include  the  intensity  dependent  transmission 
coefficient,  photonic  bands  with  intensity  dependent  stop  gaps,  coherent  soliton  like 
states  within  forbidden  states  of  linear  theory,  and  phase-matched  second  harmonic 
generation  in  inversion  symmetric  systems. 


*  permanent  address:  Department  of  Physics,  University  ofColorr^lo, 
Colorado  Springs,  CO  80933 
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1.  INTRODUCTION 


There  are  many  very  interesting  phenomena  associated  with  the  propagation  of 
electromagnetic  and  electronic  waves  in  spatially  periodic,  nonlinear  media.  Recently,  a 
number  of  these  phenomena  have  been  studied.  In  particular,  Delyon  et  aO  have  shown 
that  a  periodic  modulation  of  a  one-dimensional  nonlinear  medium  induces  multistability 
in  the  transmitted  wave  intensity,  while  Mills  and  Trullinger^  have  examined  the 
localized,  solitonlike  structures  associated  with  these  systems.  We  have  also  studied  these 
phenomena  from  the  point  of  view  of  applications  to  the  ballistic  transport  of  electrons  in 
semiconductor  superlattices^,  nonlinear  transmission  of  light  through  multiple  quantum 
well  systems^  and  universal  dynamics  of  a  discrete  Hamiltonian  map  describing  these 
systems^.  Here  we  shall  concentrate  our  attention  on  the  photo-electronic  excitations  and 
intensity-dependent  electromagnetic  wave  propagation  in  the  presence  of  externally 
imposed  spatial  periodicity  of  the  medium. 

Specifically,  we  first  investigate  the  transmission  of  waves  via  tunneling  of  carriers 
in  quasi-one-dimensional  arrays  of  semimagnetic  quantum  dots  immersed  in  a 
nonmagnetic  material.  This  quantum  wire  system  is  modeled  by  the  appropriately  derived 
Schrodinger  equation,  which  is  then  transformed  into  a  nonlinear  complex  map  relating 
wavefunctions  of  the  electrons  on  successive  layers.  The  analysis  of  this  map  is  then 
performed  according  to  the  methods  previously  described^. 

Subsequently,  we  study  the  tunneling  problem  for  the  periodically  modulated  wire. 
We  find  that  the  large  magnetopolaron  (free  carriers  dressed  by  the  induced  polarization 
of  the  magnetic  ions)  effects  due  to  the  exchange  interaction  with  magnetic  ions  result  in 
a  transmission  via  coherent,  solitonlike  electronic  states.  The  calculated  transmission 
coefficient  shows  the  opening  of  intensity  dependent  gaps  in  the  transmission  spectrum. 
The  experimental  tunneling  studies  of,  for  example,  CdMnTe  quantum  wires  should 
demonstrate  the  here  predicted  effects  of  multistability,  hysteresis,  and  the  opening  of 
stop  gaps  in  the  energy  spectrum. 
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The  research  relevant  to  this  pan  is  described  in  more  detail  in  the  paper  entitled 
“Tunneling  in  a  Periodic  Array  of  Sendmagnenc  Quantum  Dots”  which  has  been 
submitted  for  publication  in  Physical  Review  Letters  and  its  manuscript  is  included  in 
this  repon  as  the  Appendix  I. 

It  should  be  noted  here  that  although  this  paper  deals  specifically  with 
semimagnetic  electronic  systems,  the  problems  investigated  there  are  more  general  in 
nature  and  are  relevant  to  a  wide  variety  of  systems,  including  light  propagation  in  optical 
media  with  periodically  varying  dielectric  constant.  The  effort  in  the  direction  of  these 
kinds  of  applications  is  presently  underway. 

The  studies  so  far  were  based  on  two  basic  assumptions:  the  modulation  of  the 
dielectric  medium  is  on  the  scale  of  the  wavelength  of  the  propagating  wave  which  is 
typically  much  greater  than  the  length  scale  associated  with  both  individual  atoms  and 
their  separation,  and  the  propagating  wave  is  strictly  monochromatic.  Therefore,  a 
classical  description  of  the  medium  in  terms  of  its  polpxizability  entering  a  single  wave 
equation  was  sufficient®.  However,  if  one  considers  the  interaction  of  individual  atoms 
with  the  field,  the  quantum  mechanical  description  of  this  interaction  is  necessary"^. 
Consequently,  in  the  second  part  of  this  research  project  we  make  the  first  step  towards 
lifting  the  first  of  the  above  assumptions. 

The  first  principle  study  of  nonlinear  photo-electronic  excitations  (polaritons)  in 
saturable,  dispersive  and  absorptive  medium  necessitates  a  self-consistent  treatment  of 
the  propagating  field  and  the  quantum  medium.  The  so-called  Maxwell-Bloch  equations^ 
provide  a  full  description  of  thp  spatial  and  temporal  behavior  of  this  coupled  system.  We 
investigate  the  transmission  of  photons  through  a  one-dimensional  array  of  polarizable 
quantum  dots  using  these  equations.  This  approach  allows  the  study  of  transient  effects 
which  are  crucial  for  optical  pulses  shorter  than  the  relaxation  times  of  the  medium.  Here, 
however,  we  have  left  the  temporal  behavior  in  the  coherent  regime  to  a  subsequent 
projects  and  have  concentrated  on  spatial  behavior  of  relatively  long  pulses. 


32-3 


The  Maxwell-Bloch  equations  for  the  quantum  wire  are  solved  in  the  steady  state 
regime  by  a  suitable  reduction  to  a  three-dimensional  discrete  map^.  This  nonlinear  map 
admits  stable  linearized  solutions  in  the  form  of  polariton  bands  as  well  as  nonlinear, 
coherent  solutions  corresponding  to  solitons  and  solitonic  trains.  The  transmission 
properties  of  the  system  are  studied  via  the  threshold  transmission  coefficient  and  they 
exhibit  transmitting  solitonlike  states  in  the  polariton  gaps  as  well  as  stop  gaps  in  the 
polariton  spectrum.  These  solitons  and  polaiitons  can  be  manipulated  by  separately 
contacting  individual  dots  giving  rise  to  a  very  exciting  possibility  of  a  tunable  optical 
system. 

More  details  of  this  research  project  are  given  in  the  paper  entitled  "Nonlinear 
Polariton  Excitations  in  Quantum  Dot  Arrays",  submitted  for  publication  in  Physical 
Review  Letters,  and  en^dosed  here  as  the  Appendix  II. 

Again,  let  us  remark  that  the  interesting  results  contained  in  this  part  of  research  are 
not  limited  to  semiconductor  quantum  wires,  but  are  also  applicable  to  the  problems  of 
waves  propagation  in  optical  media.  The  future  research  in  this  area  shall  focus  on  these 
applications  and  should  be  extended  into  the  full  spatio-temporal  regime  of  transient 
effects  relevant  to  the  propagation  of  short  optical  pulses. 

Finally,  the  last  project  which,  although  not  yet  completed  at  the  present  time,  will 
be  continued  in  the  future,  dealt  with  the  attempt  at  lifting  the  second  approximation 
mentioned  above.  The  simultaneous  propagation  of  two  waves:  th’  fundamental  and  its 
second  harmonic,  was  investigated  in  the  presence  of  an  external,  periodic  dc-field.  This 
situation  is  particularly  relevant  to  the  probler  j  of  efficient  second  harmonic  generation  in 
the  inversion  symmetric  media  such  as  optical  fibers*  or  possibly  thin  dielectric  films 
acting  as  optical  w'aveguides  and  periodically  f  oled  by  a  static  electric  field.  The  research 
in  this  area  is  being  continutxl. 
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abstract 

We  investigate  the  tunneling  of  carriers  in  a  quasi  one  dimensional  array  of  semimagnetic 
quantum  dots.  The  large  magnetopolaron  effects  due  to  the  exchange  interaction  of  carriers 
with  magnetic  ions  result  in  a  transmission  via  soliton-like  electronic  states.  The  intensity 
dependent  transmission  coefficient  shows  the  opening  of  intensity  dependent  gaps  in  the 

transmission  spectrum. 
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INTRODUCTION. 


Magnetopolaron  effects  in  the  bulk  semimagnetic  semiconductors  such  as 
Cdi-yMnyTe  and  quantum  wells  are  well  docuraentedUl,  Magnetopolarons  are  ffee  carriers 
dressed  oy  the  induced  magnetic  polarization  field  of  the  magnetic  ions.  Polaronic  effects 
are  weak  in  three  dimensions  and  can  only  be  observed  with  carriers  bound  to  impurities. 

In  quantum  wells  their  stability  is  marginal  In  one  dimensional  systems  these  polaronic 
effects  should  be  strong  and  lead  to  localized,  soliton-like  statesl^l.  The  strength  of 
polaronic  effects  should  increase  dramatically  in  quannim  dots.  We  consider  here  an  array 
of  coupled  quanmm  dots  as  a  model  of  a  periodic  nonlinear  system .  Such  system  can  be 
realized  in  a  quanmm  wire  with  strongly  varying  concentration  of  magnetic  ions. 

THE  MODEL 


Let  us  consider  a  Cdi.yMnyTe  quantum  wire  with  an  effective  radius  ro  normal  to 
the  growth  direction  the  z-axis.  The  wire  is  built  with  unit  cells  of  periodicity  a..  The  unit 
cell  of  this  wire  consists  of  a  dot  of  width  d  with  low  Mn  concentration  (y  -  0. 1),  and  a 
barrier  of  width  b  and  a  high  Mn  concentration  (y  -  0.7).  in  the  barrier  the 
antiferromagnetic  interaction  between  Mn  ions  dominates  and  we  expect  the  barrier  to  be  in 
a  "spin  glass"  phase.  The  dot  with  a  low  Mn  concentration  is  assumed  to  be  in  a 
paramagnetic  phase.  In  the  "mean  field"  approximation,  the  effective  Hamiltonian  for  the 
dot  can  be  written  asW: 


H  = 


-Vo-V„,B5/2 


V 


(1) 


The  first  term  is  the  kinetic  energy,  Vq  is  the  barrier  height  (energy  being  measured  from 

the  top  of  the  barrier),  tXPNqJz  is  the  magnetic  potential  with  pNo  being  the 

^0 


exchange  energy  (-  880  raeV  for  the  valence  band)  %  is  the  average  effective  concentration 
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Mnionsaisd  Jz  =  — isdie^imoffieecaaieisQxto).  IlieBdDooinfaacooaBsQ 

describes  the  paiaiiiagiieoc  soscqidbifi^  of  ^  3yfa  ioo  «idi  sfiin  SC  in  the  efiEEcnve 
magnetic  field  Beff^  die  c^Ecctne  te^^)eIallle  Teff.  The  efiGECtive  fidd  Bcffacring  on  the  ion 

is  a  som  (tf  dK  exfemal  fidd  Bo  and  the  exchange  field  Bex  =  jpNoJzlt|r(z)P /No  wiudt 

dq>ends  ni  the  prolMbifiiyly(z)P  of  the  earner  bemg  in  the  posiiiooirf  die  magnetic  ion. 
No  is  the  mimber  of  cations  per  unit  vdume.  Tim  wavefimetioo  for  the  carrier  in  the  wife 

is  taken  in  the  fonn  y(r.z)  =  ^z).  Subsiitiding  the  wave  fimetion  tirinto 

Eq.  1,  and  retaining  only  linear  lenns  in  the  expanskm  Bs^xK477x.  gives  asimple 
nonlinear  Sdirodinger  equation  fm^z)  whidh  can  be  written 


2m  az' 


T^z)-|vo+VB+(~pNoJz/nc^^z)l^|4(z)  =  E#(z)  intheweH 


*2  32 

and - y^z)  =  E^z)  in  the  barrier.  (2) 

2m  32^ 

ifere  He  is  the  number  of  cations  in  the  welL  Vb  =  0.467  gMolteBo^Teff  is  the  potential 
due  to  the  external  magnetic  field  vdiile  the  last  term  is  the  self-consistent  potential  due  to 
the  exchange  field.  In  die  following  we  ^lall  lestrict  ourselves  to  zero  external  magnetic 
field  and  nairow  dots.  To  understand  the  combined  effects  of  nonlinearity  and  periodicity 
it  is  sufficient  to  replace  the  effective  potential  of  the  well  by  a  suitable  chosen  delta 
function  potendaL  The  resulting  Schrodinger  equation  for  a  wire  with  N  dots  now  takes 
the  final  form: 


^zjvd  -»-adal(j>(z)l^|5(z  -  /a)|<{>(z)  =  E0(z) 


(3). 
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TIk  eae^  in  Eq3  is  measiaed  ia  de  effieciiwe  Rydbo^s  Ry  aad  kn^dis  in  efifective  Bohr 
GMfi  SQ.  For  hides  in  CdTe  widi  an  efifeciive  mass  OJ  of  die  election  mass.  Ae  effective 

Rydberg  Ry- 72  meV  and  die  Bohr  iadiiisao=  10  A.  In  Eq.  3  the  function  ^  is 
dhnensionless  (#  -»  .  and  die  pommalV  has  been  dwsen  to  iqnoduce  the  bound 

stmettf  a  single  well  of  vndifad  and  banierhd^Vo.  FbrV(plRy.d=:^A,as40A. 
k^Teff=5meV  we  esdmaie  V = 0.69  Ry .  The  coupling  constant  a  dqiends  on  die  number 
Mn  kns.  and  for  10  catioos  per  doc  we  estimate  cc^lO'^Ry. 

describes  an  air^  of  coiqiled  quannmi  dots .  Because  of  the  nonlineari^.  die 
e^enstates  of  EqJ  cannot  be  clasafiedaccmdingu)  the  Bloch  sdienie.  Weproceedby 
integrating  Eq3  across  the  /-A  singularity.  Requiring  the  continnity  of  the  wavefunction 
4(f-)  =  and  writing  the  energy  E  of  a  panicle  as  E^^-one  obtains  a  nonlinear 

conqikx  map  lelaiing  wavefitnctioos  on  successive  layers: 

/•  ^  VdasinOra)  ada^sin(ka),  oA 

hi  the  absence  of  nonlinearity  (a  =  0)  the  wavefimctions  are  Bloch  states  =  exp(iKf  a) 
characterized  by  a  Blodi  index  k.  The  standard  band  structure  is  Aen  obtained  from  Eq.4 
wiA  a  single  band  for  negative  energies  and  an  infinite  number  of  bands  and  gaps  for 
positive  energies  (above  Ae  barrier).  Such  a  clasrification  is  possible  because  we  only 
need  the  relation  between  the  phase  of  Ae  wavefunction  and  its  energy.  In  Ae  nonlinear 
case  we  must  retain  Ae  information  about  Ae  phase  and  Ae  amplitude.  This  problem  is 
simplified  due  to  Ae  global  gauge  mvariance  ot  Eq.4,  i.e.  Ae  conservation  of  Ae  current 

J/  =  Consequently,  Ae  fom-dimensional  map  of  Eq.4  can  be 

reducedt^l  to  a  two-dimensional  area  preserving  map  wiA  Ae  current  J  and  Ae  energy  E 
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as  paiameteis.  The  clas^cadon  of  the  solutions  of  Eq.4  is  hence  reduced  to  the  study  of 
the  fixed  points  of  this  nonlinear  map.  and  their  basins  of  stability.  Thishasbeen 
discussed  in  detail  in  Ref.  [5].  The  main  effects  are  illustrated  by  considering  the 
tunneling  of  etectrons  along  a  wire. 

THE  TUNNELING 

Let  os  consider  a  situation  in  which  the  periodically  modulated  Cdi.^Mn^^Te  section 
of  die  wire  with  N  dots  is  inserted  into  the  CdTe  wire.  The  electrons  traveling  in  a  wire 
will  be  transmitted  with  the  probability  ITl^  and  reflected  with  the  probability  IRI^.  Those 
electrons  which  ate  transmitted  have  traveled  through  a  self-consistent  band  structure  due  to 
the  exchange  interaction  with  magnetic  ions.  In  our  model  without  exchange  interaction, 
tunneling  can  occur  if  the  energy  of  the  transmitted  particle  is  within  the  allowed  energy 
spectrum  of  the  Bloch  band  irrespective  of  the  amplitude  of  the  wavefunction  or  the  flux  of 
electrons.  This  is  no  longer  true  in  the  nonlinear  case. 

Let  us  first  consider  a  single  dot  centered  around  z=0  We  are  interested  in  a  state 
trapped  in  the  dot  The  wavefunction  for  this  state  has  a  form  (jKz)=Aexp(-qlzi)  with  energy 
E=-q2  It  is  easy  to  see  from  Eq.3  that  the  corresponding  energy  is  given  by  E=-{  Vd/(2- 
(xad)}2  Increasing  the  nonlinearity  a  lowers  the  energy.  The  first  nontrivial  case 
corresponds  to  two  coupled  dots.  In  this  case  there  are  two  states.  In  Fig.l  we  show  the 
structure  with  two  dots  in  the  resonant  tunneling  geometry.  The  transmission  problem  is 
solved  in  the  usual  way:  we  assume  that  the  wavefunction  at  the  end  of  the  wire  has  the 
form  of  the  plane  wave  ^(z)  =  Texp(ikz),  where  k  is  the  wavevector  corresponding  to  the 
energy  E=-q2,  and  T  is  the  amplitude  of  the  transmitted  wave.  We  next  iterate  the 
wavefunction  backwards ,  matching  appropriately  to  the  Cdi.xMnxTe  barriers.The  inset  (a) 
shows  the  transmission  coefficient  as  a  function  of  energy  of  incident  particles  in  the 
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absence  of  nonlinearity.  Two  peaks  correspond  to  two  resonant  states  of  a  coupled  dot 
system.  In  the  presence  of  a  nonlinear  interaction  the  intensity  of  transmitted  particles 
becomes  a  multivalued  fimction  of  the  intensity  lli^  of  the  incident  particles .  as  shown  in 
the  inset  (b).  Fig.l.  Hence  the  usual  transmission  coefficient  ITI^/ill^  depends  on  both  the 
energy  of  the  particle  and  transmitted  intensity.  When  the  number  of  dots  N  increases  the 
energies  form  a  band.  The  transmission  coefficient  becomes  significant  for  the  incident 
particle  energy  within  the  allowed  energy  band  of  the  N  dot  array.  In  Fig.2  we  show  the 
transmission  spectrum  for  a  linear  ( ot=0 ,  dashed  line )  and  nonlinear  ( otsO.Ol ,  solid  line 
)  array  of  10  dots .  The  linear  transmission  shows  an  almost  uniform  spectrum  of  10  peaks 
corresponding  to  the  10  possible  states .  The  nonlinear  spectrum  is  shifted  to  lower 
energies  and  shows  only  3  distinct  and  well  resolved  peaks.  This  illustrates  the  the 
opening  of  gaps  i.e.  forbidden ,  nontransmiiting  energy  regions  within  linearly  allowed 
band. 

The  qualitative  features  of  the  transmission  problem  are  given  by  the  "phase 
diagram"  in  parameter  space  of  the  energy  E  and  amplitude  ITl  of  the  transmitted  particle. 
To  determine  whether  particle  was  transmitted ,  we  iterate  the  wavefunction  backward.  The 
solutions  which  do  not  grow  exponentially  are  considered  to  be  transmitting.  In  this  way 
we  can  map  out  the  regions  of  parameter  space  with  a  finite  transmission. 

In  Fig.3  we  show  such  a  diagram  corresponding  to  particles  incident  on  the  array  of 
10  coupled  dots  for  which  transmission  spectrum  is  shown  in  Fig.2.  The  daiic  areas 
correspond  to  transmitting  states.We  see  clearly  that  the  transmitting  region  breaks  down 
into  three  distinct  regions  as  shown  in  Fig.2.  Hence  the  transmission  via  flux  carrying 
states  opens  gaps  in  energy  spectrum  of  the  linear  theory.  The  complex  nature  of  this  phase 
diagram  is  associated  with  the  anaiysis^^)  of  the  nonlinear  dynamical  map  given  by  Eq.4. 
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In  summary,  we  have  shown  th^  semimagnetic  semiconductor  quantum  dots 
should  exhibit  interesting  features  due  to  strong  magnetopolaronic  effects.  The  tunneling 
studies  should  demonostrate  the  multistability,  hysteresis,  and  the  opening  of  gaps  in  the 
energy  spectrum. 
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Figure  captions. 

Rg.l.  The  schematic  picture  of  the  energy  diagram  of  two  quantum  dots  ( dashed  lines )  in 
a  scattering  geometry.  The  inset  (a)  shows  the  transmission  coefficient  m2/IIl2  as  a  function 
of  energy  E.  The  inset  (b)  shows  the  incident  intensity  lll^  as  a  function  of  transmitted 
intensity  ITl^  in  the  nonlinear  case  ( ot=.01 ). 

Rg.2.  The  transmission  coefficient  ITl2/iI|2  as  a  function  of  energy  E  for  10  dots :  linear 
spectrum  -  solid  line;  nonlinear  spectrum  ( a=.01  )-dashed  line. 

Rg.3  The  "phase  diagram"  ota2|Tl2  vs  E  of  parameters  corresponding  to  transmitted 
particle  with  energy  E  and  amplitude  IT  l2 . 
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abstract 

We  investigate  the  tunneling  of  photons  in  a  quasi  one  dimensional  array  of  polarizable 
quantum  dots  using  Maxwell-Bloch  equations.  This  allows  a  first  principle  study  of 
nonlinear  polaritons  in  saturable,  dispersive  and  absorptive,  periodic  medium.  The 
nonlinear  field  equation  for  the  steady  state  generates  linearized  excitations  which  form 
photonic  bands  and  nonlinear  excitations :  gap  solitons.  The  possibility  of  manipulating  the 
photonic  gaps  and  gap  solitons  by  separately  contacting  the  dots  is  examined. 
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The  interaction  of  photons  with  electronic  excitations  in  an  artificially  structured 
dielectric  medium  leads  to  many  interesting  and  important  phenomena;  photonic  gaps  and 
gap  solitons^l'^)  providing  good  example.  The  modulation  of  the  dielectric  medium  is  on 
the  order  of  the  wavelength  of  light ,  typically  much  greater  than  the  length  scale  associated 
with  both  individual  atoms  and  their  separation.  Hence  a  classical  description  of  the 
medium  in  terms  of  its  polarizability  is  sufficient  If  one  considers  the  interaction  of 
individual  atoms  with  the  field ,  the  quantum  mechanical  description  of  this  interaction 
leads  to  many  interesting  phenomena  such  as  Rabii  oscillations,  photon  echos,  and  self- 
induced  transparency^^!.  The  dielectric  medium  and  individual  atom  phenomena  can  be 
combined  by  creating  arrays  of  mesoscopic  atomsW  ( quantum  dots )  frozen  in  a 
semiconductor  matrix.  A  typical  separation  a  between  dots  can  be  comparable  to  the 
wavelength  corresponding  to  the  lowest  radiative  transition  of  the  dot  at  frequency  mo. 
Hence  one  must  describe  self-consistendy  spatial  and  temporal  collective  behaviour  of  the 
field  and  quantum  dots.  This  behaviotir  can  be  modified  by  selectively  contacting  individual 
dots  and  changing  their  equilibrium  state  by .  for  example,  electrically  injecting  electron 
hole-pairs.  To  describe  the  essential  features  of  such  a  complex  system  we  shall  adopt  a 
model  one  dimensional  array  of  two-level  dots.  Each  quantum  dot  is  characterized  by  its 
complex  polarization  P(x,t)  and  population  inversion  &i(x,t).  The  time  t  is  measured  in 
units  of  characteristic  transition  frequency  mo*!  and  distance  x  in  units  of  the  separation  of 
dots  a. .  The  collective  state  of  quantum  dots  and  the  linearly  polari^d  complex 
electromagnetic  field  E(x,t)  is  described  by  the  Maxwell-Bloch  equations^^*^); 
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(1). 


V  V 

/»=-i(l-jr)P+^(£  +  £*)5n 

5/t,  =  -i(E + £*)(?*  -  P)  -  nSn  -  5n^) 

Here  v  is  the  effective  velocity  of  light,  y  and  F  are  the  inverse  of  relaxation  rates  of 
polarization  and  population  inversion,  respectively,  and  5n0  is  the  equilibrium  population 
inversion  in  the  absence  of  the  field.  Sn^=:l  corresponds  to  the  ground  state  being  fully 
occupied  while  8n0=-l  corresponds  to  fully  inverted  state.  Different  Sn®  might  be 
obtained  by  e.g.  separately  contacting  individual  dots  and  electrically  injecting  electron-hole 
pairs.  The  polarization  P(x,t)  is  coupled  to  the  field  E(x,t)  via  the  spatially  dependent 
coupling  constant  ^  (proportional  to  the  dipole  moment  of  the  dot )  of  the  form 

N 

(jc)  =  5(x  -  /) .  All  parameters  of  the  theory  can  be  determined  experimentally  by 

<•1 

studying  the  linear  response  of  the  system. 

The  Maxwell-Bloch  equations  provide  a  full  description  of  the  spatial  and  temporal 
behaviour  of  the  field  and  the  dots.  In  the  spatially  homogeneous  medium  ( the  Jaynes- 
Cununings  model )  the  full  solution  of  the  Maxwell-Bloch  equations  exhibits  chaotic 
behaviour^®.  The  spatio-temporal  behaviour  for  a  pulse  width  shorter  than  the  relaxation 
time  in  a  homogeneous  medium  exhibits  seif- induced  transparency^^.  Even  in  a 
homogeneous  medium  the  nonlinear  interactions  lead  to  nontrivial  selection  rules  for  the 
solitary  waves(8) .  We  shall  leave  the  temporal  behaviour  in  the  coherent  regime  to  a 
separate  presentation  and  concentrate  here  on  spatial  behaviour  of  pulses  long  enough  for 
transient  effects  to  be  unimportant  In  the  rotating  wave  approximation  the  stationary 
solution  P(x)  of  the  Bloch  equation  for  oscillating  fields  E(x,t)=exp(-i(i)t)E(x), 
P(x,t)=exp(-io)t)P(x)  relates  the  polarization  P(x)  to  the  local  field  E(x)  via  the  field 
dependent  susceptibility  k(E): 
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This  complex  susceptibility  reflects  the  tnmsition  from  saturable  dispersive  to  saturable 
absorptive  nonlinearity  as  the  frequency  sweeps  through  the  resonance  in  the  unbiased  dot 
Note  that  depending  on  the  value  of  equilibrium  population  inversion  the  susceptibilty  of 
the  individual  dot  can  change  from  absomdon  to  gain.  In  the  steady  state  the  nonlinear  field 


equation  determines  the  spatial  distribution  of  the  field: 

E^fxl+k^Efx)  =  -k2^^(x)K(E)E(x)  (3). 

The  wavevector  k=oyv  corresponds  to  the  propagation  of  the  field  between  the  dots.  It  is 
convenient^^l  to  define  the  field  degrees  of  fieedom  in  terms  of  its  amplitude  A(x)  and 
phase  (j»(x) ;  E(x)=:A(x)exp(i(j)(x)) ,  and  to  identify  them  with  coordinates  of  a  fictitious 
particle.  The  spatial  coordinate  can  now  be  identified  with  time.  The  corresponding 
momenta  are  defined  as  p=Ax  and  j=A2<|>x  .  The  variable  j  is  just  the  energy  flux: 
j  =s  -ii  1 2){£*£,  -  ££*} .  Tlie  equation  of  motion  for  the  field  now  takes  a  very  simple  and 


intuitive  form: 

P.  =  -*'A+4y-it'^'(jc)Re(v(A)M 
A 

(4). 


The  first  equation  describes  a  harmonic,  isotropic  oscillator  in  the  presence  of  a  time 
(tependent  nonlinear  force  proportioRfU  to  the  dispersive  part  of  the  susceptibility  and  in  the 
presence  of  a  time  dependent  "centrifugal  force"  ( the  term  proportional  to  j^/A^ ).  The 
variable  j  can  be  interpreted  as  the  angular  momentum  of  the  oscillator.  The  second 
equation  describes  the  evolution  of  the  momentum  j  and  is  governed  purely  by  the 
absorptive  part  of  the  susceptibility.  Clearly  from  the  form  of  the  susceptibility  3l.2,  and 
Eq.4  it  is  obvious  that  the  losses  lead  to  a  collapse  while  gain  leads  to  a  growth  of  the 
centrifugal  barrier  of  the  harmonic  oscillator.  In  the  absence  of  losses  ( gain )  j=const  and 


32-21 


lheca»e<rfsnMnami^ttiie$(noiisainabieiMnBneariqr).Eq.41a$l>eensai>fiedittdeail 
in  Ref(9) .  Fidkmii^  Rd(9}  the  scrioiioo  of  E4-4  cao  be  wiinen  in  a  focm  a  coiii(na  3- 

dhiM^ifinal  ntwilmfar  map- 

Q^2c-skpt^it{K)~Q 

A 

T=J-ksfi^it'{K)IC  (5). 

niieie  R=a2  .  Q=s{VkA4€ .  s=sm(k).  c=cos(k).  and  k”  and  x*  stand  for  die  reM 
and  imagioaiy  pans  of  the  sosceptibiliiy.  EqJ  aUovs  for  tbe  analysis  of  tbe  glsbai 
piDpenies  of  d]e  mxilioearwave  eqoaiioo. 

Let  us  first  cousider  the  st^iiliQf  ol  a  plane  wave  E(x)=Texp(ilcx).  Starting  with  the 
initiai  condition  R=:T^.  Q=cos(k).  J=T^sin(k)  we  iterate  the  map  given  by  Eq  N  times. 
The  orbits  vliose  amplitude  R  remaiDS  within  a  fixed  radios  Rmaz  fiom  the  initial  copditUMi 
are  classified  as  stable.  Hence  we  can  create  a  phase  diagram  (T2.ci>)  of  staMe  or* 

Hg.l  we  show  a  phase  diagram  amesponding  to  parameters  N=1 1.  ^l.l . 

y=r=.01.  Rmax'-O^  •  All  dots  in  the  absence  v.  the  field  £(x)  are  in  their  ground  state. 
The  black  crosses  in  Fig.l  correspond  to  stable  orbits.  Hg.l.  ^ows  two  wide  photonic 
gaps  on  both  sides  of  the  resonant  frequency.  These  gaps  begin  to  fill  up  woch  stable  states 
-gap  solitons.  The  region  in  the  vicinity  of  the  resonant  frequency  is  quite  complex  ( 
disordered )  for  low  ampiimde  of  the  plane  wave.  As  the  amplitude  T^  increases  the 
absorptive  losses  saturate  and  plane  waves  with  larger  amplitudes  become  more  stable. 
Eq.5  and  the  phase  diagram  it  generates  can  be  used  to  study  the  transmission  experiment 
However .  we  shall  adopt  a  more  standard  ( but  entirely  equivalent )  approach  to  the 
transmt  sion  problem  in  terms  of  transfer  matrices. 

We  decompose  the  field  E(x)  between  {n-l)-st  and  (n)  dot  in  terras  of  forward  (Aq. 
l)  and  backward  (Bn-i)  propagating  components:  E(x)=An.iexp(ik(x-{n-l))+Bn-iexp(-ik(x- 
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(s>1))l  FoDowing  Ref(10)  we  cast  die  problem  of  finding  dK  scdotkms  to  Eq3  in 
die  mnsfiEr  maiiix  B1  defined  by: 


(6). 


Tbe  lisear  ofNkad  propenies  are  obtained  from  Ibis  transfer  matrix  osing  lineaiiKd 
susceptibility.  The  ptdaeiUMi  Bloch  waveveoorQ  can  be  easily  computed  fiomdietiaceof 
dre  tiansfo  matrix  M:  2co£Q=s(M).  The  effect  of  quantom  dots  is  to  renonnalize  the 
{^Kxon  group  velocity  at  the  bottOTKff  the  pdaiiton  band :  2.  and  to 

introduce  gaps  in  the  poiaiiion  spectrum.  The  lowest  polaiiton  gap  falls  in  the  fieqoency 
range :  +2(1-  v;r))  <  m<  mt  .  This  allows  to  detennine  two  parameteisof  the 

theory:  v  and  For  parameters  cone^oding  to  Fig.  1  the  lowest  polariton  band  is  in  the 
frequency  range  (0.36.0.63) . 

To  calculate  the  transmission  coefficient  we  start  with  a  plane  wave  solution  at  the 
end  of  the  sample  in  the  form  of  a  transmitted  wave  TexpOkx).  This  solution  is  iterated 
backwards  and  matched  across  the  first  layer  to  the  wave  composed  of  incident  I  and 
reflected  R  components:  E(x)=:Iexp(ikx)+Rexp(-ikx).  The  transfer  matrix  across  the  first 
layer  is  slightty  different  from  the  transfer  matrix  M .  In  this  way  we  can  detennine  the 
incident  wave  intensity  as  a  function  of  transmitted  intensity  fTI^  for  each  fiequency  m 

and  number  of  layers  N.  Several  such  functions  for  selected  frequencies  are  shown  in 
Fig.2.  For  fiequencies  in  the  photonic  band  of  the  linear  theory  the  transmitted  intensity  is  a 
linear  function  of  the  incident  intensity.  For  higher  incident  intensity  the  transmitted 
intensity  ITl^  becomes  a  multivalued  function  of  the  incident  intensity  ill^  in  a  usual  way. 

For  fiequencies  in  the  gap  of  the  linear  theory  the  incident  intensity  III^  goes  toward  a 
constant  as  the  transmitted  intensity  vanishes.  Hence  a  finite  incident  intensity  is  needed 
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for  a  Snife  nansmisaoa  to  occur.  The  photonic  gaps  can  be  penetrated  by  a  beam  of 
sufficient  intensi^  by  propagating  a  solitary  wave. 

The  situation  in  Hg.2.  resembles  bifurcation  pbenmiena  in  equlibtium  phase 
oansitions:  die  multivaluedness  in  the  band  of  thetxy  is  reminiscent  of  first  order 
{diase  tranations.  and  the  multivaluedness  in  the  gap  is  reminiscent  of  second  order  phase 
transiticMis.  Naturally  the  behaviour  of  the  sysmm  will  be  different  for  increasing  or 
decreasing  iixddent  intensity .  Even  mcxe  complex  behaviour  is  expected  in  the  vicinity  of 
the  resonance  ( see  Fig.1. ) .  Is  it  then  possible  to  define  a  meaningful  transmission 
coefficient  as  a  function  of  fiequncy  ca?.  We  propose  to  characterise  the  system  by  a 
threshold  transmission  coefficienL  The  threshold  transmission  coefficient  corresponds  to 
the  ratio  of  the  transmitted  intensity  to  the  minimum  incident  intensity  required  for  finite 
transmission.  In  short .  for  each  frequency  we  construct  the  incident  intensity  as  a  function 
of  the  transmitted  intensi^  and  find  a  minimum.  The  threshold  transmission  coefficient 
cone^nds  to  this  minimum.  We  show  the  linear  transmission  coefficient  (Fig.3a)  and  the 
threshold  transmission  coefficient  (Hg.3b)  as  a  function  of  frequency  for  the  same  set  of 
parameters  as  in  Fig.  1  and  Fig.2.  The  threshold  transmission  coefficient  illustrates  nicely 
the  effects  of  nonlinearity  in  terms  of  filling  up  the  polariton  gaps  with  nonlinear ,  soliton 
like  excitations.  Fig.3c  illustrates  briefly  the  effect  of  altering  the  equillibrium  value  of 
inversion  of  every  other  dot  from  5n0=:l  to  5nO=0.  This  essentially  removes  half  of  the 
dots  and  leads  to  changes  in  the  transmission  coefficienL 

In  summary,  we  studied  Maxwell-Bloch  equations  for  a  periodic  array  of  two  level 
quantum  dots.  The  spatial  distribution  of  the  electromagnetic  field  "dressed  up"  by  induced 
polarization  of  quantum  dots  is  described  by  a  nonlinear  wave  equation.  Global  solutions 
of  the  wave  equation  are  equivalent  to  the  study  of  a  three  dimensional  nonlinear  map 
representing  all  irreducible  variables.  This  nonlinear  map  admits  stable  linearized  solutions 
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as  polariton  bands.  The  nonlinear  solutions  conesponding  to  periodic  orbits  of  the 
describe  solitons.  Energy  carrying  solitons  lead  to  transmission  in  the  polariton  gaps  and 
gaps  in  the  polariton  bands.  Polaritons  and  solitons  can  be  manipulated  by  separately 
contacting  individual  dots  leading  to  a  tunable  (^tical  system. 

The  Coulomb  effects  associated  with  biexcitons  in  individual  quantum  dots(^ri 
neglected  here  can  be  incorporated  into  a  more  accurate  susceptibility. 
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Figure  captions:  . 

Fig.l  Stability  diagram  (T^ko  )  of  the  plane  wave. 

Fig.2.  Incident  intensity  lli^  vs  transmitted  intensiQr  for  a  set  of  frequencies 
Q)=.05,10 . 70 

Fig.3.  Threshold  transmission  coefficient  as  a  function  of  &equency  co  for  (a)  linear  system 
(b)  nonlinear  but  unpolarized  system  (c)  polarized  nonlinear  array. 
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I.  Abstract 


The  adducts  of  HF  and  HCl  with  aluminum  hydrides,  aluminum  halides,  aluminum  hydrohalides, 
aluminum  hydroxides  and  aluminum  oxides  have  been  studied  using  both  semi-empirical  and  ab  initio 
molecular  orbital  methods.  When  the  electron  rich  region  of  HF  or  HCl  is  placed  between  2.5  -  3.5  A 
above  the  aluminum  atom,  Lewis  Acid/Base  adducts  are  generally  found.  In  a  few  cases,  the  starting 
geometry  rearranged  to  give  a  hydrogen  bonded  structure.  In  fewer  cases,  chemical  reactions  were 
observed.  The  structures  observed  are  reported  and  briefly  discussed. 

11.  Introduction 

Molecular  Orbital  Theory  has  been  able  to  describe  chemical  bonding  for  several  decades,  first  for 
pi-orbitals  using  Hiickel  Theory,  then  for  valence  orbitals  using  semi-empirical  methods  and  finally,  as 
computation  power  increased,  for  all  orbitals  by  ab  initio  methods.  For  the  past  two  decades,  ab  initio 
methods  have  been  available  and,  in  the  main,  the  size  of  the  system  which  could  be  studied  was  limited 
by  computer  power.  Within  the  past  five  years  the  availability  of  supercomputers  has  permitted  the  study 
of  large  systems  (more  than  100  electrons)  at  a  primitive  level  or  the  study  of  smaller  systems  in  more 
sophisticated  ways.  Thus,  one  could  study  the  van  der  Waal's  interaction  between  two  He  atoms  two 
decades  ago  but  only  within  the  past  five  years  could  weak  interactions,  say  of  the  order  of  3  to  30 
kcal/mole,  be  studied  and  realistically  compt^  with  experiment  We  can  now  handle  very  large  systems, 
e.  g.  DNA,  in  a  very  crude  way. 

Weak  interactions  between  molecules  can  be  cbasified  as  van  der  Waal's  type,  in  which  no 
electrons  are  transferred  (e.g.  induced  dipole-induced  dipole  or  London  Forces  and  dipole-dipole 
interactions,  of  which  hydrogen  bonds  is  the  best  example).  A  second  type  of  weak  interaction  is  the 
formation  of  Lewis  Acid^ase  adducts  in  which  an  electron  rich  region  of  one  molecule  can  donate  a  small 
fraction  of  its  electrons  to  an  electron  poor  region  of  anoth(v.  The  classic  intermolecular  example  is  borane 
carbonyl,  H3B  -  CO.  The  classic  intramolecular  example  diborane  itself,  where  the  empty  p  orbitals  on 
the  two  borane  molecules  share  the  electrons  involved  i  ,i  the  B-H  bond  on  the  other  borane  molecub. 
This  symmetric  interaction  is  often  called  a  three-cenu.;'  two-electron  bond.  Boric  acid,  B(OH)3  is  a 
monobasic  acid  because  it  does  not  donate  protons  but  ai  epts  OH*  and,  since  there  is  only  one  empty  p 
orbital,  the  molecule  is  capable  of  neutralizing  one  OH*.  ( ’learly  Lewis  Acid/Base  interactions,  especi^ly 
those  that  exceed  thermal  energy  (3/2  RT,  0.9  kcal/mole  at  room  temperature)  dictate  the  nature  of  the 
molecular  entity  that  is  present.  This  knowledge  is  fundar’vsntal  to  our  understanding  of  the  chemistry  of 
these  systems. 

The  author  has  become  more  and  more  involved  ir  boron  chemistry  since  1985  when  he  first 
suggested  that  silylborane  was  involved  in  the  deposition  of  p-doped  a-silicon.^  There  followed  a  long 
series  of  papers  dealing  with  fluorinated  silyl  boranes.2-4  Recently,  the  author  has  shown  that  BH3  was 

capable  of  forming  important  adducts  with  3p  O  atoms,  with  dioxygen,  and  OH  radicals.^  These 
adducts  may  play  an  important  and  hitherto  unsuspected  roles  in  the  oxidation  of  diborane. 

Because  borane,  allane,  gallane,  etc.  constitute  higher  energy  fuels  they  are  of  interest  to  the  Air 
Force,  especially  borane  and  allane.  Having  published  exploratory  calculations  of  the  adducts  of  borane, 
it  was  natural  for  the  author  to  expand  these  calculations  to  adducts  formed  by  aluminum  compounds.  An 
impetus  to  do  these  calculations  at  the  USAFA  was  provided  because  aluminum  oxides  constitute  an 
important  product  in  the  exhaust  of  the  solid  state  booster  rockets  for  the  Space  Shuttle.  Since  HCl  was 
also  in  the  exhaust,  it  was  only  natural  to  study  the  adducts  of  aluminum  compounds  with  HCl  and,  since 
HF  is  computationally  more  convenient,  with  HF  as  well.  Also,  although  we  did  not  appreciate  this  when 
we  first  came  to  the  F.  J.  Seiler  Laboratory,  there  was  considerable  interests  in  the  application  of 
aluminum  halide  salts  as  solvents  for  high  energy  electrochemical  cells.  These  salts  were  eutectic  mixes 
prepared  by  mixing  AICI3  with  a  Cl*  containing  salt,  such  as  NaCl  or  RCl,  where  R  is  an  organic 
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quartenary  ammonium  salt.  The  reaction  between  AICI3  and  Cl*  is,  of  course,  a  Lewis  Acid/Base  reaction 
such  as  considered  in  this  study.  Thus,  the  proposed  study  of  the  HF  and  HCl  adducts  of  aluminum 
hydrides,  halides,  hydroxides  and  oxides  was  of  some  interest  to  the  Air  Force. 

in.  The  Methods 

Two  molecular  orbital  methods  are  available  for  studying  the  the  electronic  properties  of 
molecules,  the  semi-empirical  method  in  which  the  two  electron  integrals  are  approximated  using 
experimental  data  (parameterized),  and  the  ab  inito  method  in  which  nothing  is  parameterized.  At  the  F.  J. 

Seiler  Laboratory,  the  only  computers  conveniently  available  were  a  cluster  of  ^-Vaxes.  These  machines 
can  be  applied  to  ab  initio  calculations  but  they  are  so  slow  that  such  usage  is  not  practical  for  the  the 

majority  of  molecules  studied  here.  |i-Vaxes  are  much  better  suited  to  semi-empirical  calculations  and  one 
of  the  best  programs,  MOPAC,  was  in  fact  developed  at  the  Seiler  Laboratory  under  the  auspices  of  Dr.  J. 
J.  P.  Stewart. 

Before  we  fully  appreciated  the  virtues  (and  limitations)  of  the  MOPAC  program  we  explored  the 
adducts  -sing  the  computationally  "inexpensive"  3-21 G*  basis  set  and  tite  ab  initio  program.  After 
optimization  with  this  basis  set,  the  molecules  (adducts)  were  reoptimized  using  the  6-3 IG*  basis  set. 

Then,  at  this  level,  correlation  was  taken  into  account  using  fourth  order  M(j)eller-Plesset  perturbation 
theory,  MP4(SDTQ).  Energies  so  determined  are  generally  accepted  to  be  within  1-2  kcal/mole  of 
experimental  values.  Later,  the  3-2 IG*  screening  was  abandoned  in  favor  of  MOPAC  screening.  We 
were  sufficiently  impressed  with  the  speed  and  versatility  of  the  MOPAC  program  that  we  plan  to  adopt  it 
for  all  future  preliminary  screening. 

It  is  important  to  note  that  all  calculations  were  started  with  the  electron  rich  end  of  the  HF  and  HCl 
molecules  in  the  empty  p  orbital,  two-three  angstroms  above  the  plane  of  aluminum  compounds  and, 
hence,  in  the  region  where  Lewis  Acid/Base  adducts  were  to  be  expected.  The  calculations  were, 
therefore,  biased  toward  the  anticipated  result.  As  will  be  seen,  in  most  cases  the  expected  adduct  was 
found.  However,  there  were  a  few  interesting  exceptions  where  the  molecule  rearranged/reacted  rather 
than  form  an  adduct.  There  was  insufficient  time  and  insufficient  computer  resources  to  explore  each 
individual  surface.  Hence,  we  cannot  claim  that  we  have  found  the  globaJ  minimum  for  each  interaction. 
However,  in  every  case  we  determined  the  vibrational  spectrum  of  the  stationary  point  reached  to  ensure 
that  the  programs  had  generated  true  minima  and  not  transition  states. 

IV.  Division  of  the  Problem 

It  was  quickly  apparent  that  a  study  of  the  HF  and  HCl  adducts  of  Al-H(singlet/triplet),  AIH2, 
AIH3,  Al-F(singlet/triplet),  AIF2,  AIF3,  Al-Cl  (singlet/triplet),  AICI2,  AICI3,  AIH2F,  AIHF2,  AIH2CI, 
AIHCI2,  AIH2OH,  A1H(0H)2,  A1(0H)3,  0=A1-H,  0=A1-F,  0=A1-C1,  AlO,  and  AI2O3  was  a  rather 
prodigious  undertaking.  This  required  over  50  optimizations,  frequency  analyses,  and  MP4(SDTQ) 
calculations.  [Some  of  these,  e.g.,  HC1-A1C13,  involved  too  many  electrons  (e.g.  82)  to  be  performed 
using  anything  smaller  and  slower  than  an  IBM  3090,  such  as  located  at  Oklahoma  State  University,  or 
the  CRAY-2  computer,  located  at  the  National  Center  for  Supercomputing  Applications  (NCSA), 
University  of  Illinois.]  Therefore,  we  decided  to  break  the  calculations  into  two  projects,  one  involving 
the  HF  and  HCl  adducts  of  aluminum  hydrides  and  halides,  the  other  the  HF  and  HCl  adducts  of 
aluminum  hydroxides  and  oxides.  The  manuscript  for  the  first  paper  is  being  prepared  concurrently  with 
this  report.  The  manuscript  for  the  second  paper  will  be  finished  early  this  F^. 

V.  Results 

It  would  overly  lengthen  this  report  to  discuss  all  of  the  results  in  hand.  Hence,  we  shall 
summarize  the  observations  here  and  refer  the  reader  to  the  respective  papers  for  additional  details.  Since 
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HF  and  HCl  are  dipolar  and  many  of  the  ftagmei.fs  studied  were  polar,  the  geometry  of  the  dipole-dipole 
interaction  is  not  very  different  from  that  of  the  Le  vis  Acid/  Base  adducts.  However,  one  sure  indication 
of  electron  donation  (adduct  formation)  was  the  distortion  of  the  aluminum  compound  from  the  sp^ 
hybridization  (planar)  toward  the  sp3  hybridization  (tetrahedral).  We  show  this  generally  in  Figure  1. 

In  a  few  cases,  usually  with  fragment  aluminum  compounds,  the  HF  or  HCl  adduct  rearranged  to 
hydrogen  bond  with  the  negative  region  of  the  aluminum  adduct,  vide  Figure  2,  and  the  aluminum 
fragment  hardly  changed  during  the  optimization  process.  The  most  remarkable  of  these  structures  was 
that  for  which  X=H,  i.e.  the  aluminum  hydride  fragment.  In  this  case  a  hydrogen  bond  was  formed 
between  a  hydride  hydrogen  and  a  proton-like  hydrogen.  To  the  author's  knowl^ge,  such  a  hydrogen 
bond  has  never  been  observed  either  ^eoretically  or  experimentally.  While  structures  of  this  type  were  not 
observed  for  AIXYZ  molecules,  there  is  not  a  priori  reason  to  rule  them  out.  We  feel  that  a  thorough 
investigation  of  the  surface  of,  say,  HF  and  AIH3  would  find  such  structures.  As  mentioned  previously, 
our  studies  were  biased  toward  Lewis  Acid/Base  Adducts  and  not  toward  the  formation  of  hydrogen 
bonds.  Starting  geometries  which  favored  hydrogen  bond  structures,  i.e.,  F-H  ••H-AIH2,  would 
probably  find  such  structures. 

In  a  few  cases,  chemical  reaction  was  found  to  occur.  One  example  is  the  addition  of  HF  to  the 
aluminum  oxygen  double  bond,  illustrated  in  Figure  3.  In  addition  we  observed  both  hydrogen  atom  and 
halogen  atom  abstraction  reactions. 

For  hydroxides  and  oxides  both  adduct  formation  and  H-bonding  was  observed.  Which  process 
dominated  and  was  found  by  optimization  seemed  to  depend  upon  the  nature  of  the  acid  (HF  vs  HCl)  and 
on  the  nature  of  the  aluminum  compound.  We  illustrate  the  two  possibilities  in  Figure  4  When  X  was  a 
hydroxyl  group,  the  hydrogen  bond  product  was  commonly  form^. 

The  MP4(SDTQ)  dissociation  energies  of  the  adducts  ranged  from  3.0  kcal/mole  to  25.8  kcal/mol. 
In  general  the  HCl  adducts  were  more  weakly  bound,  presumably  because  of  their  larger  size.  It  would 
unduly  lengthen  this  report  to  go  into  the  variations  in  the  dissociation  energies  observed.  The  reader  is 
directed  to  the  manuscripts  for  structural  details,  energies,  and  specific  interpretations.  Alternately,  the 
reader  can  contact  the  author  at  Oklahoma  State  University. 

VI.  Conclusions 

1)  The  empty  p  orbitals  in  Group  IIIA  compounds  leads  to  the  formation  of  Lewis  adducts  between  HCl 
and  HF  and  the  majority  of  the  ^uminum  hydrides,  halides,  hydroxides  and  oxides. 

2)  In  a  few  cases  hydrogen  bonding,  i.e.  dipole-dipole  interaction,  was  observed.  Since  all  the  molecules 
studied  were  polar,  one  suspects  that  all  of  the  aluminum  compounds  can  form  hydrogen  bonds  with  HF 
and  HCl.  The  reason  these  were  not  all  observed  in  this  study  is  believed  to  be  a  consequence  of  the  initial 
starting  geometry. 

3)  Based  on  these  studies  one  can  infer  that  HCl  is  certainly  hydrogen  bonded  to  the  aluminum 
oxides/hydroxides  in  the  exhaust  of  the  Space  Shuttle's  solid  state  booster  rockets.  This  is  important  since 
there  is  concern  about  the  fate  of  HCl  gas  in  the  rocket  exhaust. 

4)  HCl  and  HF  should  be  highly  soluble  in  AlCl3-NaCl  melts.  These  may  offer  a  way  of  modifying  the 
properties  of  these  liquids.  Similarly,  aluminum  hydroxide,  chloroallane,  etc.  would  be  expected  to 
dissolve  in  these  melts  to  alter  the  liquid  properties. 


33-4 


VII.  Future  Work 


Although  we  (the  author  and  his  Graduate  Student,  Mr.  Marty  Wilson)  have  conducted  a  broad 
study  of  the  HT/HCl  adducts  of  aluminum  hyc’jides,  halides,  hydroxides  and  oxides,  we  have  but  barely 
scratched  the  surface  of  this  field.  Each  of  the  50  some  adducts  deserves  a  more  detailed  study  and  a  full 
understanding  of  these  interactions  will  only  come  from  a  careful  exploration  of  each  energy  surface. 

HF  and  HCl  have  scarcely  exhausted  the  list  of  potential  electron  donors.  One  can  think  about 
water,  ammonia,  alcohols,  amines,  etc.  as  well.  Further,  since  these  adducts  may  not  be  the  global 
minima  one  should  search  chemical  reactions  in  which  new  chemical  bonds  were  formed.  For  instance,  in 
one  case  HF  was  found  to  add  to  react  with  0=A1-0H  to  produce  0=A1-F  and  H2O.  similarly,  one  must 
expect  that  H2O  reacts  with  AICI3  to  form  AICI2OH  and  HCl,  hydrogen  bonded  or  as  a  Lewis  adduct. 
The  transition  states  for  these  reactions,  none  of  which  have  been  explored,  certainly  lie  at  shorter 
alumnium  fluorine  distances  than  a  umed  in  the  starting  geometries  used  here.  Much  remains  to  be  done 
before  these  systems  are  fully  unt-  ood. 
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